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eategprjzation 

Ron Bekkerman, Ran El-Yaniv, Naftali Tishby, Yoad Winter 

March 2003 The Journal of Machine Learning Research, volume 3 

Full text available: "r%3 :xiff 176.53 KB) Additional Information: full citation , abstract , index terms 



We study an approach to text categorization that combines distributional clustering of words 
and a Support Vector Machine (SVM) classifier. This word-cluster representation is computed 
using the recently introduced Information Bottleneck method, which generates a compact 
and efficient representation of documents. When combined with the classification power of 
the SVM, this method yields high performance in text categorization. This novel combination 
of SVM witlrword-cluster representation ... 

2 CalegorM 
Janine Toole 

April 2000 Proceedings of the sixth conference on Applied natural language processing 



Full text available: 1 



:^f(691.92 KB; 



Additional Information: Ml .station, abstract, references 



Publisher S=te 



This paper introduces a system for categorizing unknown words. The system is based on a 
multicomponent architecture where each component is responsible for identifying one class 
of unknown words. The focus of this paper is the components that identify names and 
spelling errors. Each component uses a decision tree architecture to combine multiple types 
of evidence about the unknown word. The system is evaluated using data from live closed 
captions - a genre replete with a wide variety of unknown ... 

Aieksander Kolcz, Vidya Prabakarmurthi, Jugal Kalita 

October 2001 Proceedings of the tenth international conference on Information and 
knowledge management 

Additional Information: full diation . abstract references , citings . Index 
terms 



Full text available: f|.£df(945J8.^ 



We address the problem of evaluating the effectiveness of summarization techniques for the 
task of document categorization. It is argued that for a large class of automatic 
categorization algorithms, extraction-based document categorization can be viewed as a 
particular form of feature selection performed on the full text of the document and, in this 
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context, its impact can be compared with state-of-the-art feature selection techniques 
especially devised to provide good categorization performanc ... 

4 Automatic categorization of case jaw 
Paul Thompson 

May 2001 Proceedings of the 8th international conference on Artificial intelligence 
and law 

i- ii* ^ i u. ^ .. rv Additional Information: full citation, abstract, references, chinas, index 
Full text available: TO c-sftfl4.b1 ?sBj ~ 



This paper describes a series of automatic text categorization experiments with case law 
documents. Cases are categorized into 40 broad, high-level categories. These results are 
compared to an existing operational process using Boolean queries manually constructed by 
domain experts. In this categorization process recall is considered more important than 
precision. This paper investigates three algorithms that potentially could automate this 
categorization process: 1) a nearest neighbor-like a ... 

Keywords: text categorization 
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.QiasMyjng.iext.docum 

Osmar R. ZaTane, Maria-Luiza Antonie 

January 2002 Australian Computer Science Communications , Proceedings of the 

thirteenth Australasian conference on Database technologies - Volume 5, 

Volume 24 Issue 2 

Additional Information: Mlcltation, abstract, Merer-ces, cltinas, index 



Full text available: .04 MB) 

term;? 

Automatic text categorization has always been an important application and research topic 
since the inception of digital documents. Today, text categorization is a necessity due to the 
very large amount of text documents that we have to deal with daily. Many techniques and 
algorithms for automatic text categorization have been devised and proposed in the 
literature. However, there is still much room for improving the effectiveness of these 
classifiers, and new models need to be examined. We prop ... 

Keywords: association rules, classification, text categorization, text mining 



Mining unstructured data 
Ronen Feldman 

August 1999 Tutorial notes of the fifth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available: fll&dg 3.01MB ). Additional Information: MUiMion, iMexienris 



Message dass^ 

Stephan Busemann, Sven Schmeier, Roman G. Arens 

April 2000 Proceedings of the sixth conference on Applied natural language processing 

Full text available: t P hxif(677.54 KB) 

l=jT Additional Information: fvijj.cjtation, abstract, references 

Customer care in technical domains is increasingly based on e-mail communication, allowing 
for the reproduction of approved solutions. Identifying the customer's problem is often time- 
consuming, as the problem space changes if new products are launched. This paper 
describes a new approach to the classification of e-mail requests based on shallow text 
processing and machine learning techniques. It is implemented within an assistance system 
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for call center agents that is used in a commercial setti ... 

8 Demonstrations: CMedPort: a cross-regional Chinese medical porta! 

Yilu Zhou, Jialun Qin, Hsinchun Chen, Zan Huang, Yiwen Zhang, Wingyan Chung, Gang Wang 
May 2003 Proceedings of the 3rd ACM/IEEE-CS joint c nference n Digital libraries 

Full text available: ^ .pdfj[1Q6.63 KB). Additional Information: MLsjiation, abstract, references, index. teems 

CMedPort is a cross-regional Chinese medical Web portal developed in the AI Lab at the 
University of Arizona. We will demonstrate the major system functionalities. 

9 Facilitate 

Kenneth A. Graetz, Nicole Proulx, Cassie B. Barlow, Laura J. Pape 

November 1997 Proceedings of the international ACM SIGGROUP conference on 

Supporting group work : the integration challenge: the integration 

challenge 

Full text available: ^.rxijV jj}5 MB ■ Additional Information: M-Ciiation, references, index terms 



Keywords: distributed collaborator electronic brainstorming, idea generation, 
teleconferencing 



1 0 A .new jdasMi 

Orrin E. Taulbee, John T. Welch 

January 1966 Proceedings of the 1966 21st national conference 

Full text available: *g|pd[(457, 72.K.J3} Additional Information: MLsM&n, ^fe^^act, rgferences, icLdexterins 

One of the classical problems in science is that of classification. It has found its principal use 
as an assist in organizing knowledge and pinpointing areas for further investigation. In 
classifying a set of items, one must take into account both the similarities among items and 
the desired properties of the set of classes. Thus two fundamental questions arise. First, in 
what sense should the items be considered similar, and second, should the classes be a 
priori determine ... 

George Nagy 

July 1968 Communications of the ACM, volume n issue 7 

Full text available: ^j?dgl86.MBl Additional Information: fujiciiatjon, references, cjjia&s 



Keywords: character recognition, information retrieval, online reader, operator-controlled 
reader, pattern recognition, reading machine, text reading, text-image discrimination, 
unformatted text 



12 Text mining: finding nuggets In mountains of textual data 
Jochen Dorre, Peter Gerstl, Roland Seiffert 

August 1999 Proceedings of the fifth ACM SIGKDD international conference on 
Kn wledge discovery and data mining 

Full text available: ^ odf(430 i 79 KB). Additional Information: fvjjj. citation, references, cithers, index .terra 



Keywords: clustering, customer relationship management, feature extraction, text 
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categorization, text mining 



13 A.|>ra^icaj„hypMext cat | 
class information 

Hyo-Jung Oh, Sung Hyon Myaeng, Mann-Ho Lee 

July 2000 Proceedings of the 23rd annual international ACM SIGIR conference on 
Research and development in information retrieval 

Full text available: fB od»g74.31 KB: Additlonal ,nformation: MteMfcffi. abstract, mferenoes, citings, index 

As WWW grows at an increasing speed, a classifier targeted at hypertext has become in high 
demand. While document categorization is quite a mature, the issue of utilizing hypertext 
structure and hyperlinks has been relatively unexplored. In this paper, we propose a 
practical method for enhancing both the speed and the quality of hypertext categorization 
using hyperlinks. In comparison against a recently proposed technique that appears to be 
the only one of the kind, we obtained up to 18.5% of ... 

Keywords: text categorization 



1 4 A neucsMaL^ 

Leslie G. Valiant 

September 2000 Journal of the ACM (JACM), Volume 47 issue 5 

r- nt ^ ! u. is»i ,,.-yo^ ./ru Additional Information: full citation, abstract, references, cltinas, index 
Full text available: soft = *3.o6 KB; * ' ' -" , 

An architecture is described for designing systems that acquire and ma nipulate large 
amounts of unsystematized, or so-called commonsense, knowledge. Its aim is to exploit to 
the full those aspects of computational learning that are known to offer powerful solutions in 
the acquisition and maintenance of robust knowledge bases. The architecture makes explicit 
the requirements on the basic computational tasks that are to be performed and is designed 
to make this computationally tractable even ... 

Keywords: PAC learning, cognitive computation, computational learning, learning relations, 
nonmonotonic reasoning, robust reasoning 



1 5 Systems;.. A news U 
Philip J. Hayes, Laura E. Knecht, Monica J. Cellio 

February 1988 Proceedings of the second conference on Applied natural language 
processing 

Full text available: f apdf(747.12 KB:- 

Mf Additional Information: full citation, abstract, references, citings 

111 1 Publisher Site 

This paper describes a pilot version of a commercial application of natural language 
processing techniques to the problem of categorizing news stories into broad topic 
categories. The system does not perform a complete semantic or syntactic analyses of the 
input stories. Its categorizations are dependent on fragmentary recognition using pattern- 
matching techniques. The fragments it looks for are determined by a set of knowledge- 
based rules. The accuracy of the system is only slightly lower than ... 

1 6 Knowledge, H 
search, and. surnman 

Kathleen R. McKeown, Noemie Elhadad, Vasileios Hatzivassiloglou 

May 2003 Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries 
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Full text available: ^^dfQi8J.8.KBj Additional Information: Mixtion, abstract, references, citings, Index 

Despite the large amount of online medical literature, it can be difficult for clinicians to find 
relevant information at the point of patient care. In this paper, we present techniques to 
personalize the results of search, making use of the online patient record as a sophisticated, 
pre-existing user model. Our work in PERSIVAL, a medical digital library, includes methods 
for re-ranking the results of search to prioritize those that better match the patient record. 
It also generates summa ... 
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Elizabeth D. Liddy, Woojin Paik, Edmund S. Yu 

July 1994 ACM Transactions on Information Systems (TOIS), Volume 12 issue 3 

(— 1 1 , , . . ^ Additional Information: full citation, abstract, references, ciiin&s, Index 

Full text available: Wi scff 1.1 / MB) V 1 

^ terms., review 

The text categorization module described here provides a front-end filtering function for the 
larger DR-LINK text retrieval system [Liddy and Myaeing 1993]. The model evaluates a 
large incoming stream of documents to determine which documents are sufficiently similar 
to a profile at the broad subject level to warrant more refined representation and matching. 
To accomplish this task, each substantive word in a text is first categorized using a feature 
set based on the semantic Subject Field ... 



Keywords: semantic vectors, subject field coding 



1 8 inheritance .and .co 111 
Dan Flickinger, John Nerbonne 

September 1992 Computational Linguistics, volume 18 issue 3 

Full text available:™ Si 

^i^t[k ; .^J.^i/A.^ Additional Information: M .citation, abstract, references, citings 

Mechanisms for representing lexically the bulk of syntactic and semantic information for a 
language have been under active development, as is evident in the recent studies contained 
in this volume. Our study serves to highlight some of themost useful tools available for 
structured lexical representation, in particular (multiple) inheritance, default specification, 
and lexical rules. It then illustrates the value of these mechanisms in illuminating one corner 
of the lexicon involving an unusual k ... 

1 9 Ajs^opn aijo H 
Katherine M. Chudoba 

September 1999 ACM SIGMIS Database, volume 30 issue 3-4 

Full text available: ^^J.:S;3.:yiB). Additional Information: MLcMion, 'jbsiracL terms 

This paper describes a macro-level coding scheme to distinguish patterns that occur in 
groups using a group support system (GSS). The coding scheme has roots in adaptive 
structuration theory (AST) with its emphasis on how technology is appropriated or used, and 
discourse analysis that requires one consider the context of the larger discussion when 
analyzing textual data. The macro-level coding scheme revolves around junctures that occur 
during group meetings such as when a group either chooses t ... 

Keywords: GSS, adaptive structuration theory, coding scheme, discourse analysis, group 
support systems 
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to generating topic hierarchy for text segments 
Shui-Lung Chuang, Lee-Feng Chien 

November 2004 Proceedings of the Thirteenth ACM conference on Information and 
knowledge management 

Full text available: Ddf(35;L 23 KB). Additional Information: MLcjtatjon, abstract, references, index teems 

It is crucial in many information systems to organize short text segments, such as keywords 
in documents and queries from users, into a well-formed topic hierarchy. In this paper, we 
address the problem of generating topic hierarchies for diverse text segments with a general 
and practical approach that uses the Web as an additional knowledge source. Unlike long 
documents, short text segments typically do not contain enough information to extract 
reliable features. This work investigates the p ... 

Keywords: clustering, partitioning, search-result snippet, text segment, topic hierarchy 
generation, web data mining 



2 RCViiA.New.BMlch 

David D. Lewis, Yiming Yang, Tony G. Rose, Fan Li 

August 2004 The Journal of Machine Learning Research, Volume 5 

Full text available: ^J.pdfi62S t 2S. KB) Additional Information: feLcjiatjojL abstract, citings, jngex .terms 

Reuters Corpus Volume I (RCV1) is an archive of over 800,000 manually categorized 
newswire stories recently made available by Reuters, Ltd. for research purposes. Use of this 
data for research on text categorization requires a detailed understanding of the real world 
constraints under which the data was produced. Drawing on interviews with Reuters 
personnel and access to Reuters documentation, we describe the coding policy and quality 
control procedures used in producing the RCV1 data, the inten ... 
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summarization and browsing search results 

Krishna Kummamuru, Rohit Lotlikar, Shourya Roy, Karan Singal, Raghu Krishnapuram 
May 2004 Proceedings f the 13th internati nal conference on World Wide Web 

Full text available: ^Mf'MSJMJiB) Additional Information: .full .citation, abstract., references, index tern^s 
Organizing Web search results into a hierarchy of topics and sub-topics facilitates browsing 
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the collection and locating results of interest. In this paper, we propose a new hierarchical 
monothetic clustering algorithm to build a topic hierarchy for a collection of search results 
retrieved in response to a query. At every level of the hierarchy, the new algorithm 
progressively identifies topics in a way that maximizes the coverage while maintaining 
distinctiveness of the topics. We refer the pro ... 

Keywords: automatic taxonomy generation, clustering, data mining, search, summarization 



4 SMiar^ic.d^a,mQde{s 
Joan Peckham, Fred Maryanski 

September 1988 ACM Computing Surveys (CSUR), Volume 20 issue 3 

Full text available: fa^ff3.iQ m) Additional ,nformation: MLcMifin, sbstract, referees, citings, Index 

Semantic data models have emerged from a requirement for more expressive conceptual 
data models. Current generation data models lack direct support for relationships, data 
abstraction, inheritance, constraints, unstructured objects, and the dynamic properties of an 
application. Although the need for data models with richer semantics is widely recognized, 
no single approach has won general acceptance. This paper describes the generic properties 
of semantic data models and presents a represen ... 
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Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available: ^ GdM.2|.MB) Additional Information: fuHcltatjon, abstract, references, index te.crr-s 

Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 

TeM..caie3or]zatlon 
readable dictionary 

Elizabeth D. Liddy, Woojin Paik, Edmund S. Yu 

July 1994 ACM Transactions on Information Systems (TOIS), volume 12 issue 3 

Full text available: « cd%U7 m\ Additional ,nformation: Motion, abstract, references, citings, index 

* terms , review 

The text categorization module described here provides a front-end filtering function for the 
larger DR-LINK text retrieval system [Liddy and Myaeing 1993], The model evaluates a 
large incoming stream of documents to determine which documents are sufficiently similar 
to a profile at the broad subject level to warrant more refined representation and matching. 
To accomplish this task, each substantive word in a text is first categorized using a feature 
set based on the semantic Subject Field ... 

Keywords: semantic vectors, subject field coding 



7 Data clustering: a review 

A. K. Jain, M. N. Murty, P. J. Flynn 



September 1999 ACM Computing Surveys (CSUR), volume 31 issue 3 
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Full text available: ^i>df(636,2i.KB) Additional Information: Ml citation, abstract, references, citings, index 

tetn?^, review 

Clustering is the unsupervised classification of patterns (observations, data items, or feature 
vectors) into groups (clusters). The clustering problem has been addressed in many 
contexts and by researchers in many disciplines; this reflects its broad appeal and 
usefulness as one of the steps in exploratory data analysis. However, clustering is a difficult 
problem combinatorially, and differences in assumptions and contexts in different 
communities has made the transfer of useful generic co ... 

Keywords: cluster analysis, clustering applications, exploratory data analysis, incremental 
clustering, similarity indices, unsupervised learning 



8 A compaMlw 

Bill B. Wang, R. I. Bob Mckay, Hussein A. Abbass, Michael Barlow 

February 2003 Proceedings of the twenty-sixth Australasian computer science 

conference on Conference in research and practice in information 

technology - Volume 16 
Full text available: ^gdfQI .9. 73.KB. 5 Additional Information: MLQMion, ^irad, lefeMlc&s, index.terQ is 

We introduced a novel method employing a hierarchical domain ontology structure to 
extract features representing documents in our previous publication (Wang 2002). All raw 
words in the training documents are mapped to concepts in a concept hierarchy derived 
from the domain ontology. Based on these concepts, a concept hierarchy is established for 
the training document space, using is-a relationships defined in the domain ontology. An 
optimum concept set may be obtained by searching the concept hi ... 

Keywords: x 2 statistics, KNN algorithm, concept hierarchy, information gain, ontology, 
principal component analysis, text classification 



9 Sp.0ciaj.jssue.on 

Inderjit S. Dhillon, Subramanyam Mallela, Rahul Kumar 

March 2003 The Journal of Machine Learning Research, volume 3 

Full text available: ^ ad fn 7 1.07 KB:- Additional Information: Mi citation, absirad. cities, index terms 

High dimensionality of text can be a deterrent in applying complex learners such as Support 
Vector Machines to the task of text classification. Feature clustering is a powerful alternative 
to feature selection for reducing the dimensionality of text data. In this paper we propose a 
new information-theoretic divisive algorithm for feature/word clustering and apply it to text 
classification. Existing techniques for such "distributional clustering" of words are 
agglomerative in nature and result in ... ' 



1 0 Ibreg^dimensional .obj.ert. recogn jt jon 
Paul J. Besl, Ramesh C. Jain 

March 1985 ACM Computing Surveys (CSUR), volume 17 issue l 

Additional Information: fall cilation. abstract references, citings , inck-x 



Full text available: "fllpd ft / . / 6 MB) 

^ " terms, review 

A general-purpose computer vision system must be capable of recognizing three- 
dimensional (3-D) objects. This paper proposes a precise definition of the 3-D object 
recognition problem, discusses basic concepts associated with this problem, and reviews the 
relevant literature. Because range images (or depth maps) are often used as sensor input 
instead of intensity images, techniques for obtaining, processing, and characterizing range 
data are also surveyed. 
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11 Jniage.annoMiQn.and.ytdeo 
summaries 

Frank Shipman, Andreas Girgensohn, Lynn Wilcox 

November 2003 Proceedings of the eleventh ACM international conference on 
Multimedia 

Full text available: ^pdfC522.J8.KB.} Additional Information: Ml.PiMion, sbsiract, references, index.ten^s 

In this paper, we describe how a detail-on-demand representation for interactive video is 
used in video summarization. Our approach automatically generates a hypervideo composed 
of multiple video summary levels and navigational links between these summaries and the 
original video. Viewers may interactively select the amount of detail they see, access more 
detailed summaries, and navigate to the source video through the summary. We created a 
representation for interactive video that supports a wi ... 

Keywords: hypervideo, link generation, video editing, video summarization 



1 2 CompuMi^^ 

Paul Suetens, Pascai Fua, Andrew J. Hanson 

March 1992 ACM Computing Surveys (CSUR) ; volume 24 issue l 

Additional Information: lull citation , abstract , references , diinos , tfsdex 



Full text available: Wi odf(6.3/ MB- 

' terms, review 

This article reviews the available methods for automated identification of objects in digital 
images. The techniques are classified into groups according to the nature of the 
computational strategy used. Four classes are proposed: (1) the simplest strategies, which 
work on data appropriate for feature vector classification, (2) methods that match models to 
symbolic data structures for situations involving reliable data and complex models, (3) 
approaches that fit models to the photometry and ... 

Keywords: image understanding, model-based vision, object recognition 



1 3 Text 

Inderjit S. Dhillon, Subramanyam Mallela, Rahul Kumar 

July 2002 Proceedings of the eighth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available: ^j?dg393..07. KB). Additional Information: fyj]_ citation, isostf £Ct, references, index te^rns 

In this paper we propose a new information-theoretic divisive algorithm for word clustering 
applied to text classification. In previous work, such "distributional clustering" of features 
has been found to achieve improvements over feature selection in terms of classification 
accuracy, especially at lower number of features [2, 28]. However the existing clustering 
techniques are agglomerative in nature and result in (i) sub-optimal word clusters and (ii) 
high computational cost. In order to expli ... 

PIQposals.and.repo 

J. R. Tracey, D. E. Rugh, W. S. Starkey 

August 1999 ACM SIGDOC Asterisk J urnal f Computer Documentation, Volume 23 issue 3 
Full text available: ^g&ft&MMB}. Additional Information: Ml.5M.i9.n, index jejnvs 
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Allison L. Powell, James C. French 

October 2003 ACM Transact! ns n Information Systems (TOIS), volume 21 Issue 4 
Full text available: ^pdf(668 ; 40.KB] Additional Information: MLcjtation, abstract, references, ind^ terra 

The proliferation of online information resources increases the importance of effective and 
efficient information retrieval in a multicollection environment. Multicollection searching is 
cast in three parts: collection selection (also referred to as database selection), query 
processing and results merging. In this work, we focus our attention on the evaluation of the 
first step, collection selection. In this article, we present a detailed discussion of the 
methodology that we used to evaluate an ... 

Keywords: Collection selection, database selection, distributed information retrieval, 
distributed text retrieval, metasearch engine, resource discovery, resource ranking, resource 
selection, server ranking, server selection, text retrieval 
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Rob Kling 

January 1980 ACM Computing Surveys (CSUR), Volume 12 issue 1 

Full text available: ^£df(3 ; MMB). Additional Information: Ml citation, references, citings, ind.extein.is 



1 7 Modetbased. reca 

Roland T. Chin, Charles R. Dyer 

March 1986 ACM Computing Surveys (CSUR), volume 18 issue 1 

Additional Information: full cilaiion , -abstract references, citings , index 



Full text available: 1ilpdF<4.Q4 hAB) 

^ terms, review; 

This paper presents a comparative study and survey of model-based object-recognition 
algorithms for robot vision. The goal of these algorithms is to recognize the identity, 
position, and orientation of randomly oriented industrial parts. In one form this is commonly 
referred to as the "bin-picking" problem, in which the parts to be recognized are presented 
in a jumbled bin. The paper is organized according to 2-D, 2V2-D, and 3-D object 
representations, which are used as the basis for ... 



18 GiQSS: text-source discovery over the internet H 
Luis Gravano, Hector Garda-Molina, Anthony Tomasic 

June 1999 ACM Transactions on Database Systems (TODS), volume 24 issue 2 

Full text available- H odft?30 37 m) Additional lnformation: M.cMofl, extract, minces, citings, Index 
^ ' terms, review 

The dramatic growth of the Internet has created a new problem for users: location of the 
relevant sources of documents. This article presents a framework for (and experimentally 
analyzes a solution to) this problem, which we call the text-source discovery problem. Our 
approach consists of two phases. First, each text source exports its contents to a centralized 
service. Second, users present queries to the service, which returns an ordered list of 
promising text sources. T ... 

Keywords: Internet search and retrieval, digital libraries, distributed information retrieval, 
text databases 



19 Industrial applications: The importance of dealing with uncertainty in the evaluation of 
Gerardo Canfora, Luigi Troiano 
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July 2002 Pr ceedings f the 14th international c nference on S ftware engineering 
and knowledge engineering 

Full text available: ;xif< 336.10 KB) Additional Information: fuli clhation . abstract, refersricfts. index terms 



The correct choice of software tools and methods is a critical success factor to reach and 
maintain market leadership. A mature approach to estimate the impact and risk of 
technology adoption is required. This paper underlines the need for dealing with uncertainty 
to manage correctly the risk of decision-making and proposes a method for evaluating 
software engineering methods and tools. The method, named Software Engineering Fuzzy 
Evaluation Method (SEFEM) is centred on a new class of fuzzy aggr ... 

Keywords: aggregation, decision making, decision support systems, evaluation, fuzzy logic 

20 Experience Using Multiprocessor Systems — A Status Report Q 
Anita K. Jones, Peter Schwarz 

June 1980 ACM Computing Surveys (CSUR), volume 12 issue 2 

Full text available: ^pGf(4 ; 4S.MB). Additional Information: Ml.PMJ.QD.. I&ferences, citings, index tecnos 
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1 Evaluating top-ft queries over web -accessible databases 
Amelie Marian, Nicolas Bruno, Luis Gravano 

June 2004 ACM Transactions on Database Systems (TODS), volume 29 issue 2 

Full text available: ^p:df(.1 : G3.MB). Additional Information: M cjMion, abstract, references, .index .teni£ 

A query to a web search engine usually consists of a list of keywords, to which the search 
engine responds with the best or "top" k pages for the query. This top-/c query model is 
prevalent over multimedia collections in general, but also over plain relational data for 
certain applications. For example, consider a relation with information on available 
restaurants, including their location, price range for one diner, and overall food rating. A 
user who queries such a relation might ... 

Keywords: Parallel query processing, query optimization, top-k query processing, web 
databases. 



2 IechnjcjaJ.sessj Q 
Qi Xue, Aura Ganz 

October 2004 Proceedings of the 12th annual ACM international conference on 
Multimedia 

Full text available: ^.pdft18Q..Q9 KB). Additional Information: fuicitatjon, £bst[£d:, reM^niXis, index teems 

In this paper, we propose Spacing-based Channel Occupancy Regulation (SCORE) MAC 
protocol that provides <i>proportional service differentiation</i> in terms of normalized 
throughput in wireless LANs. As shown by our system model and simulation study, SCORE 
provides consistent, scalable and adjustable proportional differentiation for any network 
size, any service class distribution, any node data rate and any packet size. Compared to 
state-of-the-art prioritized service differentiation ... 

Keywords: channel occupancy, medium access control, multimedia support,. proportional 
service differentiation, wireless LANs 
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Sebastian Elbaum, Gregg Rothermel, Satya Kanduri, Alexey G. Malishevsky 
September 2004 Software Quality Contr I, volume 12 issue 3 
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Full text available: |p p ^ Additional Information: Ml.dMi.on, sbstrsct, index terms 

Regression testing is an expensive testing process used to validate modified software and 
detect whether new faults have been introduced into previously tested code. To reduce the 
cost of regression testing, software testers may prioritize their test cases so that those 
which are more important, by some measure, are run earlier in the regression testing 
process. One goal of prioritization is to increase a test suite's rate of fault detection. 
Previous empirical studies have shown that several ... 

Keywords: empirical studies, regression testing, test case prioritization 



4 Asymmetric 

Preference Ranking 

Aleksander Kotcz, Joshua Alspector 

January 2002 Information Retrieval, volume 5 issue l 

Full text available: f|| Pubi;c;h . >r s . tp Additional Information: MLsMion, abstract, index terms 

In certain classification problems there is a strong a asymmetry between the number of 
labeled examples available for each of the classes involved. In an extreme case, there may 
be a complete lack of labeled data for one of the classes while, at the same time, there are 
adequate labeled examples for the others, accompanied by a large body of unlabeled data. 
Since most classification algorithms require some information about all classes involved, 
label estimation for the un-represented class i ... 

Keywords: imbalanced training data, incomplete data problems, information retrieval, 
personalization, user modeling 



5 Reseai;ch.sessjo 
predicates for top-k queries 

Kevin Chen-Chuan Chang, Seung-won Hwang 

June 2002 Proceedings of the 2002 ACM SIGMOD international conference on 

Management of data 

f— . • , , .. Ul < . 0v Additional Information: full citation, abstract, reference?: cltinos, index 
Full text available: f £hx-f(1 b v -s MB- * 1 * 

terms 

This paper addresses the problem of evaluating ranked top-k queries with expensive 
predicates. As major DBMSs now all support expensive user-defined predicates for Boolean 
queries, we believe such support for ranked queries will be even more important: First 
ranked queries often need to model user-specific concepts of preference, relevance, or 
similarity, which call for dynamic user-defined functions. Second, middleware systems must 
incorporate external predicates for integrating autonomo ... 

6 Knowledge. aM 

Kathleen R. McKeown, Noemie Elhadad, Vasileios Hatzivassiloglou 

May 2003 Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries 

Full text available: Afidff 116.18 KB) Additjonal lnformatjon: McKayon, ateiml referent citing jnciex 
^ " terms 

Despite the large amount of online medical literature, it can be difficult for clinicians to find 
relevant information at the point of patient care. In this paper, we present techniques to 
personalize the results of search, making use of the online patient record as a sophisticated, 
pre-existing user model. Our work in PERSIVAL, a medical digital library, includes methods 
for re-ranking the results of search to prioritize those that better match the patient record. 
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design decisions: lessons from the field 
Mike Moore, Rick Kazman, Mark Klein, Jai Asundi 

May 2003 Proceedings of the 25th International Conference on Software Engineering 

Full text available: fU odfj[567.08 KB) 

Mi Additional Information: full citation, abstract, references, index terms 

This paper outlines experiences with using economic criteria to make architecture design 
decisions. It briefly describes the CBAM (Cost Benefit Analysis Method) framework applied to 
estimate the value of architectural strategies in a NASA project, the ECS. This paper 
describes the practical difficulties and experiences in applying the method to a large real- 
world system. It concludes with some lessons learned from the experience. 

8 jnduMrial.appllcation 
software^ 

Gerardo Canfora, Luigi Troiano 

July 2002 Proceedings of the 14th international conference on Software engineering 
and knowledge engineering 

Full text available: ^^:4O35J.0„KBj Additional Information: Miration, abstract, i;eMerices, index.fercns 

The correct choice of software tools and methods is a critical success factor to reach and 
maintain market leadership. A mature approach to estimate the impact and risk of 
technology adoption is required. This paper underlines the need for dealing with uncertainty 
to manage correctly the risk of decision-making and proposes a method for evaluating 
software engineering methods and tools. The method, named Software Engineering Fuzzy 
Evaluation Method (SEFEM) is centred on a new class of fuzzy aggr ... 

Keywords: aggregation, decision making, decision support systems, evaluation, fuzzy logic 
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JncorporM^ 

Sebastian Elbaum, Alexey Malishevsky, Gregg Rothermel 

July 2001 Proceedings of the 23rd International Conference on Software Engineering 

Full text available: ffip#f(S6S.24 KB) Additional Information: full citation , abstract , references , stings. Index 
flpubljsherSite tennns 

Test case prioritization techniques schedule test cases. for regression testing in an order that 
increases their ability to meet some performance goal. One performance goal, rate of fault 
detection, measures how quickly faults are detected within the testing process. In previous 
work we provided a metric, APFD, for measuring rate of fault detection, and techniques for 
prioritizing test cases to improve APFD, and reported the results of experiments using those 
techniques. This met ... 

Keywords: fault severity, rate of fault detection, regression testing, test case prioritization, 
test cost 



10 SpecjaLissue.on^ 
categorization 

Ron Bekkerman, Ran El-Yaniv, Naftali Tishby, Yoad Winter 

March 2003 The Journal of Machine Learning Research, volume 3 

Full text available: WpdfC176..53..KB. J Additional Information: Mi .citation, abstract, .index .terms 
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We study an approach to text categorization that combines distributional clustering of words 
and a Support Vector Machine (SVM) classifier. This word-cluster representation is computed 
using the recently introduced Information Bottleneck method, which generates a compact 
and efficient representation of documents. When combined with the classification power of 
the SVM, this method yields high performance in text categorization. This novel combination 
of SVM with word-cluster representation ... 

Wei Van, Chun-hsien Chen, Li Pheng Khoo 

April 2002 Artificial Intelligence for Engineering Design, Analysis and Manufacturing, 

Volume 16 Issue 2 
Additional Information: Mcitation, abstract 

Product definition is widely accepted as one of the key factors to be considered in the early 
stage of product design and development. It has a direct impact on the success of a new 
product in the marketplace. Frequently, product definition is solicited through the voice of 
the customer (VoC). As such, an organization will obtain a prominent competitive edge over 
its competitors if it is able to effectively capture the genuine VoC or the requirements of a 
customer. Sorting techniques provide a m ... 

Keywords: Customer Requirements Acquisition, Fuzzy Evaluation, Knowledge Acquisition, 
Picture Sorts, Repeated Single-Criterion Sorts, Sorting Techniques 



12 Wori<shg£^ 

understanding quality attributes in software architecture structures 
Mikael Svahnberg, Claes Wohlin, Lars Lundberg, Michael Mattsson 

July 2002 Proceedings of the 14th international conference on Software engineering 
and knowledge engineering 

Full text available: ^odf(2M,24 KB). Additional Information: MLeitatipn., abstract, references, jndex te;;rns 

To sustain the qualities of a software system during evolution, and to adapt the quality 
attributes as the requirements evolve, it is necessary to have a clear software architecture 
that is understood by all developers and to which all changes to the system adheres. This 
software architecture can be created beforehand, but must also be updated as the domain of 
the software, and hence the requirements on the software system evolves. Creating an 
architectural structure for a system or part of a sy ... 

Keywords: analytic hierarchy process, architecture structures, quality attributes 



1 3 Extract Ing. predicates fr^ 

Surajit Chaudhuri, Vivek Narasayya, Sunita Sarawagi 

September 2004 ACM Transactions on Database Systems (TODS), volume 29 issue 3 

Full text available: "g.pciggQa^I KB} Additional Information: fe:]j.c!iatiofi 4 abstfiot, Index Xeun$. 

Modern relational database systems are beginning to support ad hoc queries on mining 
models. In this article, we explore novel techniques for optimizing queries that contain 
predicates on the results of application of mining models to relational data. For such queries, 
we use the internal structure of the mining model to automatically derive traditional 
database predicates. We present algorithms for deriving such predicates for a large class of 
popular discrete mining models: decision trees, nai ... 

Keywords: Complex predicate optimization, simpler rules from complex predictive functions 
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14 Planning jn.M^ 

Thomas Bartold, Edmund Durfee 

July 2003 Proceedings f the sec nd international joint conference on Autonomous 
agents and multiagent systems 

Full text available: ^pdf(245.32.KBj Additional Information; Ml citation, abstract, references, index jer;r;s. 

Multiagent systems sometimes undergo changes that cause coordination commitments to 
become insufficient or out of date, such that the coordinated agent plans need to be 
repaired or replaced. When recoordination becomes necessary, disruption to the 
commitments made by the agents in their original plans should be minimized. We approach 
the problem of minimizing disruption by augmenting pre-existing coordination technology by 
developing metrics and automated processes for it to rank and potentially ... 

Keywords: biased search, conflict resolution, coordination of multiple agents, disruption, 
multiagent planning 




1 5 A . su(vey.of Web .metrics 

Devanshu Dhyani, Wee Keong Ng, Sourav S. Bhowmick 

December 2002 ACM Computing Surveys (CSUR), volume 34 issue 4 

Full text available: |^££f(289,2£.KBj Additional Information: Ml .SiMion, ^fesirad, reM^Dces, U i d ex „te ; xd.s 

The unabated growth and increasing significance of the World Wide Web has resulted in a 
flurry of research activity to improve its capacity for serving information more effectively. 
But at the heart of these efforts lie implicit assumptions about "quality" and "usefulness" of 
Web resources and services. This observation points towards measurements and models 
that quantify various attributes of web sites. The science of measuring all aspects of 
information, especially its storage and retrieval or ... 

Keywords: Information theoretic, PageRank, Web graph, Web metrics, Web page similarity, 
quality metrics 



16 R@M£nt. overlay networks 

David Andersen, Hari Balakrishnan, Frans Kaashoek, Robert Morris 

October 2001 ACM SIGOPS Operating Systems Review , Proceedings of the eighteenth 

ACM symposium on Operating systems principles, volume 35 issue 5 

c II i v>+ « ii ui .a <fv* Additional Information: foil citation , abstract , references, citings . Index 
Full text available: TMPofi \ .o0 MEh : ' ' — ' 



1 terms 



A Resilient Overlay Network (RON) is an architecture that allows distributed Internet 
applications to detect and recover from path outages and periods of degraded performance 
within several seconds, improving over today's wide-area routing protocols that take at least 
several minutes to recover. A RON is an application-layer overlay on top of the existing 
Internet routing substrate. The RON nodes monitor the functioning and quality of the 
Internet paths among themselves, and use this information ... 

17 Similar^ 

Surajit Chaudhuri, Kris Ganjam, Venkatesh Ganti, Rajeev Motwani 
June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management fdata 

Full text available: ^.P.df(2Z.I,i7. KB). Additional Information: iv;lj.c|taiipn J abstract, references, jndex ter^s 

To ensure high data quality, data warehouses must validate and cleanse incoming data 
tuples from external sources. In many situations, clean tuples must match acceptable tuples 
in reference tables. For example, product name and description fields in a sales record from 
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a distributor must match the pre-recorded name and description fields in a product 
reference relation. A significant challenge in such a scenario is to implement an efficient and 
accurate fuzzy match operation that can effec ... 

1 8 Efficient 

Jeffrey P. Harrison, M. Nicholas Coppola, Mark Wakefield 
October 2004 Journal of Medical Systems, voiu me 28 Issue 5 

Full text available: || P bjj ^ §it Additional Information: fy]i. dUalion. 3tiM[&& 

This study evaluates the technical efficiency of federal hospitals in the United States using a 
variable returns to scale, input-oriented, data envelopment analysis (DEA) methodology. 
Hospital executives, health care policy-makers, taxpayers, and other stakeholders, benefit 
from studies that improve the efficiency of federal hospitals. Data for 280 federal hospitals 
in 1998 and 245 in 2001 were analyzed using DEA to measure hospital efficiency. Results 
indicate overall efficiency in federal h ... 

Keywords: DEA, data envelopment analysis, federal hospital efficiency, hospital efficiency 



1 9 Exploiting, hierarchy 

Prasanna Ganesan, Hector Garcia-Molina, Jennifer Widom 

January 2003 ACM Transactions on Information Systems (TOIS), volume 21 issue 1 
Full text available: ^jj j pdff2S5.S0 KB? Additional Information: full citation, abstract , references , index terms 

The notion of similarity between objects finds use in many contexts, for example, in search 
engines, collaborative filtering, and clustering. Objects being compared often are modeled 
as sets, with their similarity traditionally determined based on set intersection. Intersection- 
based measures do not accurately capture similarity in certain domains, such as when the 
data is sparse or when there are known relationships between items within sets. We propose 
new measures that exploit a hierarchical ... 

Keywords: Similarity measures, collaborative filtering, data mining, hierarchy 



20 A Survey of Information Retrieval Vendors 
Robert J. Kuhns 

October 1996 Technical Report, Sun Microsystems, Inc. 
Full text available: ^pd£176 J.5.KB • Additional Information: Ml.cjtat.iQn, abstract 

This report is a survey of vendors that develop and market information retrieval technology. 
The objective of this survey is to provide information for those who want an overview of text 
retrieval and document management companies, their products, and their indexing and 
searching capabilities. More specifically, it presents summary information for each 
corporation surveyed regarding its business, markets, products, and technology. Although 
the conclusion contains a discussion of trends, commonal ... 
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Detailed Description 

lexical analyzer 10 10 analyzes semantic alternatives for the search 
term if the determined relevance is below a predetermined threshold, 
e.g., 0 In such cases, lexical analyzer 1010 deten 1 mines the relevance 
score of each synonym of the search term and, adds the relevance score 
, weighted by 1.0, to the previously determined weighted average 
relevance score . If the new relevance score is at least the 
predetermined threshold, relevance analysis stops. Conversely, additional 
synonyms are analyzed in the same manner. 

If all synonyms are exhausted and the cumulative relevance score is 
still below the predetermined threshold, lexical analyzer 10 10 adds 
weighted relevance scores of hyponyms of the search term to the 
cumulative relevance score in the same manner. A hyponyra of a given 
word is a more specific version. . . 
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... In this bar chart, three separate scores are displayed for each 
capability component: A Capability Assessment , A Maximum Applicable 
Weighted Score , and a Maximum Potential Score, which correspond to the 
scores in FIG. 6. The bar. . . 

...5) Maximum Potential 

Weighted Score, and (6) percent of Maximum Achieved. The "percent of 
Maximum Applicable Achieved" measurement is calculated by dividing 
the Assessment 

Weighted Score by the Maximum Applicable Weighted Score . 

Likewise, the 
10 



"percent of Maximum Achieved" is calculated by dividing the Assessment 
Weighted Score... 
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Claims 
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English Abstract 

A method of aggregating and disseminating time sensitive information. A 
data source is searched to identify recently-posted information items 
matching predetermined selection criteria. A respective relevance score 
is calculated for each identified information item. A respective urgency 
rating is determined for each identified information item. Each 



information item is triaged using the urgency rating, and disseminated to 
at least one client based on the triage result. 

French Abstract 

L 1 invention concerne un procede de recueil et de dissemination 
d 1 informations temporaires . On consulte une source de donnees afin 
d 1 identifier les articles d f information postes recemment et qui 
correspondent a des criteres de selection predetermines. On calcule un 
resultat pertinent respectif pour chaque article d 1 information identifie. 
On determine par la suite une evaluation d'urgence respective pour chaque 
article d f information identifie. On trie chaque article d * information a 
1 ' aide de l 1 evaluation d'urgence et on le repand a au moins un client 
d'apres le resultat du triage. 
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Detailed Description 

. . . as just one example, creating a separate "tree" of related terms for 
each category. 

The relevancy weight is preferably provided as a numerical value that 
can be used directly in the computation of the relevancy score (RS) 
of an information item in which the specified term appears. Rather than 
indicating the... 

Claim 

... heterogenous data sources. 

9 A method as claimed in claim 1, wherein the step of calculating a 
respective relevance score for each identified information item 
comprises steps of 

providing a taxonomy of terms including at least a respective relevance 

weighting of each term; and 
processing content of the information item using the taxonomy to derive 
a composite relevance score for the information item. 

10 A method as claimed in claim 9, wherein the step. . . 
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Claims 
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English Abstract 

A non-textual data searching system according to the invention is capable 
of searching non-textual data at semantic levels above the fundamental 
symbolic level. The actual searching process is analogous to a 
conventional text-based search engine: a query vector, which identifies a 
number of fuzzy attributes of the desired data, is processed to retrieve 
and rank a number of keytroids representing clusters of fuzzy attribute 
vectors, where each fuzzy attribute vector represents a data event 
associated with one or more non-textual data points. The keytroids can be 
inverse-mapped to obtain data events and/or non-textual data points that 
satisfy the query. 

French Abstract 

L 1 invention concerne un systeme de recherche de donnees non textuelles 
capable de rechercher ce type de donnees a des niveaux semantiques 
superieurs au niveau symbolique fondamental. L'approche generale consiste 
par commencer a indexer le corpus de donnees non textuelles d'une maniere 
facilitant la recherche. Le processus d' indexation a pour resultat de 
fournir un certain nombre de centroides-cles representant des grappes de 
vecteurs d ! attribut flou, chaque vecteur d f attribut flou representant un 
evenement de donnees associe a un ou plusieurs points de donnees non 
.textuelles. Le processus de recherche est analogue a un moteur de 
recherche classique de donnees textuelles: un vecteur requete, 
identifiant un certain nombre d'attributs flous des donnees recherchees, 
est traite afin d'extraire et d'ordonner un nombre de centroides-cles. 
Les centroides-cles peuvent etre utilises en mode inverse afin d'obtenir 
des evenements de donnees et/ou des points de donnees non textuelles 
satisfaisant la requete. 
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Detailed Description 

... of outgoing links would induce a higher weight. 

Other factors may be included in the relevance weighting , such as the 
number of times a particular page has been visited, or indicators of 
previous relevance judgments by earlier users. More pecuniary search 
engine operators may even increase document relevance weightings in 
return for payment. 
3.3 - User Relevance Feedback. 

[0059] The final function of a search engine is to incorporate relevance 

assessments by the user to refine, and hopefully to improve, the 
retrieval and ranking of documents... 
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English Abstract 

A non-textual data searching system according to the invention is capable 
of searching non-textual data at semantic levels above the fundamental 
symbolic level. The actual searching process is analogous to a 
conventional text-based search engine: a query vector, which identifies a 
number of fuzzy attributes of the desired data, is processed to retrieve 
and rank a number of keytroids representing clusters of fuzzy attribute 
vectors, where each fuzzy attribute vector represents a data event 
associated with one or more non-textual data points. The keytroids can be 
inverse-mapped to obtain data events and/or non-textual data points that 
satisfy the query. 

French Abstract 

L f invention concerne un systeme de recherche de donnees non textuelles 
capable de rechercher ce type de donnees a des niveaux semantiques 
superieurs au niveau symbolique fondamental. L'approche generale consiste 
par commencer a indexer le corpus de donnees non textuelles d ! une maniere 
facilitant la recherche. Le processus d 1 indexation a pour resultat de 
fournir un certain nombre de centroides-cles representant des grappes de 
vecteurs d'attribut flou, chaque vecteur d'attribut flou representant un 
evenement de donnees associe a un ou plusieurs points de donnees non 
textuelles. Le processus de recherche est analogue a un moteur de 
recherche classique de donnees textuelles: un vecteur requete, 
identifiant un certain nombre d'attributs flous des donnees recherchees, 
est traite afin d'extraire et d'ordonner un nombre de centroides-cles. 
Les centroides-cles peuvent etre utilises en mode inverse afin d'obtenir 
des evenements de donnees et/ou des points de donnees non textuelles 
satisfaisant la requete. 
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Detailed Description 

... of outgoing links would induce a higher weight. 

Other factors may be included in the relevance weighting , such as the 
number of times a particular page has been visited, or indicators of 
previous relevance judgments by earlier users. More pecuniary search 
engine operators may even increase document relevance weightings in 
return for payment. 

3.3 - User Relevance Feedback. 

[00591 The final function of a search engine is to incorporate relevance 

assessments by the user to refine, and hopefully to improve, the 
retrieval and ranking of documents... 
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Claims 
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English Abstract 

A non-textual data indexing system according to the invention is capable 
of indexing non-textual data at semantic levels above the fundamental 
symbolic level. The general approach begins by indexing the non-textual 
data corpus in such a way as to facilitate searching. The indexing 
process results in a number of "keytroids" that represent clusters of 
fuzzy attribute vectors, where each fuzzy attribute vector represents a 
data event associated with one or more non-textual data points. 

French Abstract 

La presente invention se rapporte a un systeme de recherche de donnees 
non textuelles qui est capable de rechercher des donnees non textuelles a 
des niveaux semantiques situes au-dessus du niveau symbolique 
fondamental. L'approche generale consiste a indexer tout d f abord le 
corpus des donnees non textuelles de maniere a faciliter la recherche. Le ' 
processus d 1 indexation produit un certain nombre de "centroides-cles 11 
("keytroids") qui representent des groupes de vecteurs attributs flous, 
ou chaque vecteur attribut flou represente un evenement de donnees 
associe a un ou plusieurs points de donnees non textuelles. Le veritable 
processus de recherche est analogue a un moteur de recherche textuel 
classique: un vecteur demande, qui identifie un certain nombre 
d 1 attributs flous des donnees souhaitees, est traite pour recuperer et 
classer un certain nombre de centroides-cles. Ces centroides-cles 'peuvent 
faire l'objet d f une mise en correspondance inversee permettant 
l'obtention d'evenements de donnees et/ou de points de donnees non 
textuelles qui satisfont la demande. 
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Detailed Description / 
... of outgoing links would induce a higher weight. 

Other factors may be included in the relevance weighting , such as the 
number of times a particular page has been visited, or indicators, of 
previous relevance judgments by earlier users. More pecuniary search 
engine operators may even increase document relevance weightings in 
return for payment. 

3.3 - User Relevance Feedback. 

[00591 The final function of a search engine is to incorporate relevance 

assessments by the user to refine, and hopefully to improve, the 
retrieval and ranking of documents... 
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English Abstract 

A system includes a real process (101), which can be a portion of a 
service processes, sales marketing, manufacturing processes, and any 
other processes required to support a business. The real process often 
has information that is derived from the process directly or indirectly. 
The real process often has information that is derived from the process 
directly or indirectly. The real process often includes structured and 
unstructured information, which are difficult to filter and/or 
understand. The information is often stored in databases (103), (105), 
(107) and (109) . 

French Abstract 

L 1 invention concerne un precede et un systeme de trait ement d ! information 
ameliores utilises dans 1 ? analyse de la cause fondamentale d ! un probleme. 
Le procede consiste a introduire des donnees structurees d'un premier 
format et des donnees non structurees d f un processus reel d'un service ou 
d'une operation de fabrication, par exemple un centre d ! appels de service 
a la clientele, des systemes d ' information client de marketing ou des 
systemes d 1 information sur des produits destines a la chaine 
d ! approvisionnement . Ledit procede convertit les informations non 
structurees en un second format structure (optionnel) . Certains modes de 
realisation peuvent ne pas comprendre de donnees non structurees. Le 
procede combine les donnees structurees dans le premier format et les 
donnees structurees dans le second format. Le procede stocke alors les 
donnees structurees dans le premier format et les donnees structurees 
dans le second format dans une memoire. Le procede comprend une etape de 
traitement des donnees combinees avec un ou plusieurs processus 
commerciaux (par exemple le cycle de vie d f un client, 1 1 organisation 
d ! une societe ou les types de resolution de problemes) de maniere a 
coupler le processus commercial avec les donnees structurees et non 
structurees. Le procede traite les informations des donnees combinees 
avec un ou plusieurs modeles financiers (par exemple un modele de revenu 
ou un modele de cout) afin de coupler les modeles financiers aux donnees 
structurees et non structurees. Un ou plusieurs facteurs sont identifies 
a partir du processus reel. Ces facteurs comprennent un symptome, un 
indicateur et d'autres descriptions d'une opportunite d' amelioration. Le 
procede determine un ou plusieurs modeles d f agregat couples aux facteurs 
identifies des donnees traitees. Le procede couple un des modeles a une 
valeur economique et affiche le facteur et le modele associe au facteur 
et a la valeur economique. 
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for end-users to quickly identify which indicators may have useful 
predictive value the application computes relevance scores for all 
indicators and highlights potentially important indicators. The 
relevance scores are weighted combination of % Interactions, % 
Sample Deviation, and % Path Deviation. The following calculations are 
perfonned to produce the relevance scores . 

[00911 (% Interaction records )* (Weight 1) + (Absolute value of % Sample 
Deviation) * (Weight 2) + (Absolute value of... 
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English Abstract 

A method for evaluating the cognitive performance of an individual by 
qualitative/quantitative analysis of the performance of a task, 
characterized in that: -The task is a practical problem which may be 
exactly represented both graphically and mathematically and solved both 
graphically and computationally; -The task has a set of solutions with a 
number of elements greater than one, one element being the best possible 
solution; -And wherein the individual provides a graphical solution; 
-Said graphical solution is converted into a numerical solution; -A 
reference numerical solution is computed; -The graphical solution 
converted into the numerical solution is compared with the computed 
reference numerical solution, a difference index being determined between 
the computed reference solution and the graphical solution proposed by 
the individual. The invention also relates to an apparatus for 
implementing said method. 

French Abstract 

L 1 invention concerne un procede permettant d'evaluer la performance 
cognitive d f un individu par le biais d f une analyse 

qualitative/quantitative de 1 ! execution d'une tache caracterisee en ce 
que: cette tache est un probleme pratique pouvant etre represente 
exactement, a la fois graphiquement et mathematiquement , et pouvant etre 
resolu a la fois de maniere graphique et par le calcul; la tache presente 
un ensemble de solutions comportant un certain nombre d f elements 
superieurs a un, un element etant la meilleure solution possible; et 
l 1 individu doit fournir une solution graphique; ladite solution graphique 
est convertie en une solution numerique; une solution numerique de 
reference est calculee; la solution graphique convertie en solution 
numerique est comparee a la solution numerique de reference calculee, 
puis un index de difference est determine entre la solution de reference 
calculee et la solution graphique proposee par l f individu. L ? invention 
concerne egalement un appareil permettant de mettre en oeuvre ce procede. 

Legal Status (Type, - Date, Text) 

Publication 20030501 A2 Without international search report and to be 

republished upon receipt of that report. 

Fulltext Availability: 
Claims 

Claim 

best solution among the existing solutions, the 
comprehensive . evaluation consisting of a combination of 
said evaluation steps, possibly appropriately weighted . 

57 A method as claimed in one or more of the 
preceding claims 1 to... 
? t25/5, k/16, 18, 20-22,24-25, 27-29 
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English Abstract 

In an exemplary embodiment, invention is a method for analyzing 
portfolios of distressed financial assets for the purpose of bidding to 
acquire those assets. The method utilizes a network-based system (10) 
including a server system (12) coupled to a centralized database (20) and 
at least one client system (14). The method comprises of various steps 
from generating cash flow data table from variety of data sources (116) 
to performing sensitivity analysis (358) using Monte Carlo Simulation 
Model (114) to provide different scenarios. The method utilizes a variety 
of assumptions in performing analysis, and exports cash flow projections 
into a pre-determined format to develop financially attractive bids, 
which have strong probability of, expected return on inverstment after 
taking into account a variety of foreseeable risks. 

French Abstract 

L 1 invention concerne une methode permettant 1' analyse de portef euilles 
d'actifs financiers en crise dans le but de faire une off re et d'acquerir 
ces actifs. La methode consiste a utiliser un systeme reseau (10) 
comprenant un systeme serveur (12) couple a une banque de donnees 
centralisee (20) et a au moins un systeme client (14). La methode 
comporte diverses etapes qui vont de la generation d'un tableau de 
donnees du cash-flow a partir d'une diversite de sources de donnees (116) 
a 1' analyse de sensibilite (358) au moyen du Modele de Simulation Monte 
Carlo (114) afin de produire des scenarios differents. La methode 
consiste a utiliser une variete de suppositions afin d'ameliorer 
1 ! analyse, et a exporter des projections du cash-flow dans un format 
predetermine afin de faire des offres f inancierement interessantes , qui 
ont une forte probabilite de gain sur 1 1 investissement apres avoir pris 
en compte un eventail de risques previsibles. 
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... to Claim 10 wherein said step of developing various assumptions 
comprises the step of inputting relevant valuation information to 
evaluate a portfolio of assets. 12. A method according to Claim 10 
wherein said step of. . . 
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English Abstract 



A customer self service system and method for performing resource search 
and selection. The method includes steps of providing an interface (12) 
enabling entry of a query (131) for a resource and specification of one 
or more user context elements (132) , each element (132) representing a 
context associated with the current user state and having context 
attributes (14) and attribute values (232) associated therewith; enabling 
user specification of relevant resource selection criteria (245) for 
enabling expression of relevance of resource results in terms of user 
context; searching a resource database and generating a resource response 
set having resources that best match a user's query (131), user context 
attributes (14) and user defined relevant resource selection criteria 
(245); presenting said resource response set (332, 333, 335, 336) to the 
user in a manner whereby a relevance of each of the resources being 
expressed in terms of user context in a manner optimised to facilitate 
resource selection; and, enabling continued user selection and 
modification (135, 136) of context attribute values (232) to enable 
increased specificity and accuracy of a user's query (131) to thereby 
result in improved selection logic and attainment of resource response 
sets best fitted to the query. 

French Abstract 

L 1 invention concerne un systeme de libre-service pour la clientele et un 
procede de recherche et de selection de ressources. Le procede consiste a 
mettre en oeuvre une interface (12) qui permet de saisir une requete 
(131) pour une ressource et de specifier un ou plusieurs elements de 
contexte utilisateur (132), chaque element (132) representant un contexte 
associe a l f etat ponctuel de 1 ' utilisateur et comprenant des attributs 
contextuels (14) et des valeurs d f attribut (232) qui leurs sont 
associees. Le procede consiste ensuite a accepter la definition par 
1 1 utilisateur de criteres pertinents de selection de ressources (245) 
permettant d'exprimer la pertinence des resultats se rapportant aux 
ressources en relation avec le contexte utilisateur. Le procede consiste 
aussi a interroger une base de donnees de ressources et a produire un 
ensemble de reponses sur les ressources qui contient les ressources 
. correspondant le mieux a une requete de 1 1 utilisateur (131), des 

attributs contextuels (14) et des criteres pertinents de selection de 
ressources (245) definis par 1 1 utilisateur . Le procede consiste en outre 
a presenter cet ensemble de reponses sur les ressources (332, 333, 335, 
336) a 1 ' utilisateur de facon telle que le caractere pertinent de chaque 
ressource soit exprimee en relation avec le contexte utilisateur, et 
optimisee pour faciliter le choix de ressources. Le procede consiste 
enfin a permettre la selection et la modification (135, 136) continues de 
valeurs d'attribut contextuels (232) afin d'assurer a la requete de 
1 1 utilisateur (131) une specif icite et une precision accrues, ce qui 
donne une logique de selection amelioree et permet la realisation 
d' ensembles de reponses sur les ressources correspondant le mieux a la 
requete . 
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Detailed Description 



inclusionary 

and exclusionary resource filters, and specification of resource 
priorities including the selection, sequencing and weighting of 
relevant 

resource evaluation criteria; 

an intuitive GUI for a customer self service system for resource 
search and selection. . . 
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English Abstract 

A document retrieval system comprising a user interface and processing 
means, wherein the user interface is configured to allow a user to enter 
a query phrase indicative of a subject of interest, and the processing 
means is operative to select query keywords from the query phrase and 
allocate weightings to the query keywords dependent upon the relative 
positions of the query keywords within the query phrase. 
French Abstract 

L f invention concerne un systeme de recherche documentaire comprenant une 
interface utilisateur et un moyen de traitement. L ' interface utilisateur 
est configuree pour permettre a un utilisateur d'entrer une phrase de 
requete indicative d f un sujet d'interet. Le moyen de traitement est concu 
pour selectionner des mots-cles de requete a partir de la phrase de 




requete et affecter des ponderations auxdits mots-cles, en fonction des 
positions relatives de ces derniers dans la phrase de requete. 
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number of relevant documents the keyword appears in, Weightin do, is 
the key-word! s weight within a relevant document and Doc Relevance 
is the relevance rating assigned to the document by the user. This 
algorithm calculates the overall relevance of a particular recurring 
keyword based upon the relevance rating assigned to the document in 
which it occurs. Thus if it occurs in many relevant documents, its mean 
weight will be high. 

The gathering of new keywords following a search may be extended to... 
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English Abstract 

The information retrieval system in accordance with the principles of the 
present invention assigns a relevance rating to each of the index entries 
without requiring an explicit input from the user. When the user selects 
and retrieves an informational item through a list of index entries 
presented as a result of a search, the relevance rating of the selected 
informational item is increased by a predetermined amount, and is further 
adjusted based on any actions the user takes subsequent to the initial 
selection. Ratings of the informational items in the database are 
determined from implicit suggestions from the usage of the retrieval 
system rather than from an explicit user input. In another aspect of the 
present invention, the ratings are allowed to decay over time to minimize 
the tendencies for historical usage biased rating, and to provide more 
temporally accurate ratings. 

French Abstract 

Le systeme d ! extraction d 1 informations selon les principes de la presente 

invention attribue une evaluation de pertinence a chaque entree d 1 index 

sans necessiter une entree explicite de 1 1 utilisateur . Quand 

1 1 utilisateur selectionne et extrait un objet d 1 informations dans une 

liste d 1 entrees d f index presentee en tant que resultat de la recherche, 

l 1 evaluation de la pertinence de 1' objet d 1 informations selectionne est 

revue a la hausse par une valeur predeterminee et est encore adaptee en 

fonction des actions effectuees par 1 1 utilisateur apres la selection 

initiale. Des evaluations des objets d 1 informations dans la base de 

donnees sont determinees a partir des suggestions implicites de 

1 1 utilisation par 1 1 utilisateur du systeme d' extraction plutot qu'a 

partir d 1 entrees utilisateur explicites. Selon un autre aspect de 

1 ! invention, les evaluations peuvent perdre en valeur dans le temps de 

maniere a minimiser les tendances a l 1 evaluation fondees sur un usage 

historique et a presenter des evaluations plus "precise du point de vue 

temporel. 
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FAQ#6: initial value + RATE#3 =0+1=1. 

As can be appreciated, the . inventive relevancy rating system 
described above, determines the relevancyratingof informational items 

, withoutrequiringanexplicitratingbyauserbyprovidinga weighted 
adjustments ofthe ratings based on the usage ofthe informational items by 
the users. The 
1 1 

inventive relevancy rating system can be used to augment 
conventionally known rating systems,. or could supplant conventional, 
explicit . . . 
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English Abstract 

Selection terminals, typically PC computers running internet browsers, 
make search requests to a searching station or search engine. The 
searching station receives search terms and performs a probabilistic 
searching operation. In this way, emphasis is placed upon received terms 
that occur infrequently within source material. Search results, in the 
form of web sites of interest of which the high value search terms occur 
are returned back to the selecting terminal for display. An icon is 
displayed at the selection terminals and search terms are supplied to the 
searching station by highlighting text of interest and then dragging and 
dropping it to the icon. In this way, it is possible for sophisticated 
searching operations to be performed with significantly less effort 
required on the part of the user. In particular, there is no requirement 
for a user to specify Boolean operations. 

French Abstract 

L 1 invention concerne des terminaux de selection, generalement des PC 



utilisant des navigateurs Web, faisant des demandes de recherche a une 
station de recherche ou a un moteur de recherche. Cette station de 
recherche recoit les termes de recherche et effectue une operation de 
recherche probabiliste . Ainsi, 1 1 accent est mis sur les termes recus qui 
apparaissent rarement dans les donnees initiales. Les resultats des 
recherches, se presentant sous forme de sites Web d'interet dans lesquels 
les termes de recherche importants apparaissent, sont renvoyes au 
terminal de selection afin d'etre affiches. Une icone est affichee sur 
les terminaux de selection et les termes de recherche sont selectionnes 
par mise en evidence du texte d'interet et par glissement-deplacement sur 
l 1 icone. Ainsi, les operations de recherche complexes necessitent 
sensiblement moins d f efforts de la part de 1 ' utilisateur . En particulier, 
1 1 utilisateur ne doit pas preciser d'operations booleennes. 
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Claim 

... query term that indexes that document. 

13 A method according to claim 12, wherein said probabilistic 
search compares weighting values so as to rank weighted documents. 

14 A method according to claim 13, wherein said probabilistic 
search identifies documents of interest in response to said ranking. 

15 A method of instructing a probabilistic... 
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English Abstract 

A method and computer program product for comparing documents includes 
segmenting a judgment matrix into a plurality of information sub-matrices 
where each submatrix has a plurality of classifications and a plurality 
of terms relevant to each classification; evaluating a relevance of each 
term of the plurality of terms with respect to each classification of 
each information sub-matrix of the information submatrices; calculating 
an information spectrum for a first document based upon at least some of 
the plurality of terms; calculating an information spectrum for a second 
document based upon at least some of the plurality of terms; and 
identifying the second document as relevant to the first document based 
upon a comparison of the calculated information spectrums. 

French Abstract 

Un procede et un logiciel informatique permettant de comparer des 
documents comprennent la segmentation d ! une matrice de jugement en une 
pluralite de sous-matrices d 1 information de sorte que chaque sous-matrice 
comporte une pluralite de classifications et une pluralite de termes 
correspondant a chaque classification ; 1' evaluation de la pertinence de 
chaque terme faisant partie de la pluralite de termes par rapport a 
chaque classification de chaque sous-matrice d 1 information faisant partie 
des sous-matrices d 1 information ; le calcul d'un spectre d 1 information 
relatif a un premier document sur la base d'au moins un certain nombre 
des termes ; le calcul d'un spectre d 1 information relatif a un deuxieme 
document sur la base d'au moins un certain nombre des termes ; et 
1 1 identification du deuxieme document comme etant pertinent relativement 
au premier document sur la base d'une comparaison des spectres 
d 1 information calcules . 
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Detailed Description 

reordered based on the IS distance between the documents and the 
query. This IS-based relevance ranking can be weighted to allow for 
the inclusion of other relevance measures and to suit the needs of 
administrators of the primary information retrieval system. 
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English Abstract 

An information search and navigation system indexes, categorizes and 
condenses data from text or other documents. In various embodiments, the 
invention further can utilize user information goals, document or website 
types and multi-page link path options that together provide a fast, 
multi-page Web navigation system. 
French Abstract 

L 1 invention concerne un systeme de recherche et d' exploration 

d 1 informations qui indexe, categorise et condense des donnees provenant 

de documents-textes ou d f autres documents. Diverses formes de realisation 



de l 1 invention permettent en outre d'utiliser des objectifs 
d ! informations d 1 utilisateur , des types de documents ou de sites Web et 
des options de circuit de liaison multipages qui forment ensemble un 
systeme d 1 exploration Web rapide multipages. 
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arranged in scoring order, or in the same order they were in the 
original content. 

Appropriate particular weighting scores for different 
characteristics, and appropriate ways of combining weight factors, 
can be specified manually by a system designer or can be determined or 
refined using probability analysis (such as Markov Modeling) ran on 
source content with predetermined weighting scores. In various specific 
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English Abstract 

A collection search system is responsive to a user query against a 
collection of documents to provide a search report. The collection search 
system includes a collection index including first predetermined single 
word and multiple word phrases as indexed terms occurring in the 
collection of documents, a linguistic parser that identifies a list of 
search terms from a user query, the linguistic parser identifying the 
list from second predetermined single words and multiple word phrases, 
and a search engine coupled to receive the list from the linguistic 
parser. The search engine operates to intersect the list with the 
collection index to identify a predetermined document from the collection 
of documents. The search engine includes an accumulator for summing a 
relevancy score for the predetermined document that is related to the 
intersection of the predetermined document with the list., 

French Abstract 

Systeme de recherche dans un ensemble de documents, qui repond a 
1 1 interrogation d'un utilisateur pour fournir un rapport de recherche. 
Ledit systeme comporte un index d 1 ensemble comportant des premiers mots 
uniques et groupes de mots multiples predetermines en tant que termes 
indexes apparaissant dans I 1 ensemble de documents, un analyseur 
linguistique qui identifie une liste de termes de recherche provenant de 
1 1 interrogation d f un utilisateur, ledit analyseur identifiant la liste a 
partir de seconds mots uniques et groupes de mots multiples 
predetermines, et une unite de recherche couplee de maniere a recevoir la 
liste de 1 ! analyseur. V unite de recherche fonctionne par recoupement de 
la liste avec'l 1 index de 1' ensemble pour identifier un document 
predetermine dans l 1 ensemble de documents. Ladite unite comporte un 
accumulateur permettant d ? additionner un score de pertinence pour le 
document predetermine, qui se rapporte au recoupement du document 
predetermine avec la liste. 
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Fulltext Availability: 
Detailed Description 
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particularly relevant and responsive to the client query. 

Various schemes can be utilized to further weight and balance the 

relevancy scores derived from term frequency and terin proximity. 
While a number of such schemes are known, the... 
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ABSTRACT EP 704836 A2 

In a vector quantization apparatus for expressing a target vector by 
using a code vector designated by an index, an error evaluating section 
(2310) performs error evaluation for a code vector without considering 
a code error of the index and error evaluation with considering the code 
error, a first selecting section (2320) selects a small number of 
indexes from a larger number of indexes on the basis of an evaluation 
result without considering the code error, and a second selecting 
section (2325) selects, on the basis of an evaluation result with 
considering the code error, an index used to express the target vector 
from a small number of indexes selected by the first selecting section, 
(see image in original document) 
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...SPECIFICATION index candidates, error evaluation values obtained for 
code vectors corresponding to the index j are weighted with respective 
probabilities . The resultant error evaluation values are summed up. 

When the following arrangement is used as another method of 
calculating . . . 

...SPECIFICATION index candidates, error evaluation values obtained for 
code vectors corresponding to the index j are weighted with respective 
probabilities . The resultant error evaluation values are summed up. 
When the following arrangement is used as another method of calculating 
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A data processing system (20) and method for automatically performing 
prioritized nursing diagnoses from patient assessment data stores a 
diagnosis table (60,70) containing relations (62,72) between nursing 
diagnoses (64,74) and patient characteristics (66,76), with each such 
relation (62,72) having a corresponding probability measure (78). A 
priority table (80) containing at least one relation (82) between a 
diagnosis (84) from a diagnosis table (60,70) and a corresponding 
priority measure (86) is also stored. The data processing system (20) 
receives assessment data, matches assessment data with relations (62,72) 
in the diagnosis table (60,70) and constructs a potential diagnosis list 

(94) including the matched diagnoses and the corresponding probability 
measures. The probability measure (99) of each diagnosis (98) in the 
potential diagnosis list (94) is weighted with the priority measure (86) 
corresponding to the diagnosis (98) from the priority table (80) . When 
more than one relation (62,72) is provided for a diagnosis (64,74) in the 
diagnosis table (60,70), probability measures (78) are combined when the 
potential diagnosis list (94) is constructed. The probability measures 

(99) are preferably added together to combine them, and are preferably 
limited to a maximum value, for example, equivalent to 99%. The diagnosis 
table (60,70) may be divided into primary (60) and secondary diagnoses 

(70) tables. Each relation (62) in the primary diagnosis table (60) has 
the same probability measure. The tables and lists of the data processing 
system (20) and method are preferably implemented in a relational 
database, (see image in original document) 
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Detailed Description 

Claims 

Fulltext Word Count: 13839 
English Abstract 

A data integration system and method collects and stores customer 
information from disparate information sources in real-time. The stored 
information can be retrieved and assembled for presentation to a user 
according to the role and/or security profile of the user. The customer 
information is presented through a 360 (deg) viewer, which is embedded in 
an application or a browser, or runs as a standalone viewer. A data 
correlation system and method correlates data records collected from 
disparate Customer Relationship Management (CRM) applications using a 
three-tier approach comprising deterministic correlation, heuristic 
correlation and historical correlation. The correlation rules are fully 
configurable and .extensible. 

French Abstract 

Un procede et un systeme d 1 integration de donnees collectent et stockent 
en temps reel les renseignements des clients provenant de sources 
d ! informations disparates. Les renseignements stockes peuvent etre 
extraits et regroupes en vue d'etre presentes a 1 1 utilisateur 
conformement a son role et/ou son profil de securite. Les renseignements 
des clients sont presentes sur un visualiseur a 360 (deg), integre dans 
une application ou un navigateur, ou qui tourne de maniere autonome. Un 
procede et un dispositif de correlation de donnees mettent en correlation 
des fiches de renseignements glanees dans des applications de gestion des 
relations avec la clientele (CRM) au moyen d f une approche tripartite 
comprenant une correlation deterministique, une correlation heuristique 
et une correlation historique. Les regies de correlation sont totalement 
conf igurables et extensibles. 

Legal Status (Type, Date, Text) 

.Publication 20041209 A2 Without international search report and to be 

republished upon receipt of that report. 

Fulltext Availability: 
Detailed Description 

Detailed Description 

... to perform correlation processing, including but not limited to, 
comparison of fields between database records, determining weighted 
probability scores for the field comparisons and thresholding the 
probability scores with thresholds stored in the Metadata Database 16. 

[00441 The Customer Index Database 14 is... 
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Publication Language: English 
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Fulltext Availability: 

Detailed Description 

Claims 

Fulltext Word Count: 13553 
English Abstract 

Described herein is a fault diagnosis system for diagnosing faults in 
complex equipment. The system includes means for storing a set of 
diagnostic signatures which relates a set of known faults which may occur 
in the equipment to respective fault symptoms which are deemed indicative 
of said known fauts; and means for processing the diagnostic signatures 
and a set of fault symptoms identified for a current state of the 
equipment to calculate diagnostic data for identifying a fault causing 
the current state of the equipment. The diagnostic data includes a 
plurality of values which are indicative of different relative non^zero 
likelihoods of each of a plurality of different known faults causing the 
current state of the equipment. 

French Abstract 

L 1 invention concerne un systeme de diagnostic de pannes permettant de 
diagnostiquer des pannes dans un equipement complexe. Le systeme comprend 
des moyens de stockage d'un ensemble de signatures de diagnostic mettant 
en rapport un ensemble de pannes connues susceptibles de survenir dans 
1 ? equipement avec des symptomes de panne respectifs reputes indiquer 
lesdites pannes connues; ainsi que des moyens de traitement des 
signatures de diagnostic et d f un ensemble de symptomes de pannes 
identifies pour un etat de courant de 1 1 equipement de maniere a calculer 
des donnees diagnostics destinees a identifier une panne causant l'etat 
de courant de 1 ' equipement . Des donnees diagnostics comportent une 
pluralite de valeurs indiquant differents probabilites relatives non 



nulles de chaque pluralite de differentes pannes connues causant 1 1 etat 
de courant de 1 1 equipement . 

Legal Status (Type, Date, Text) 

Publication 20040129 A2 Without international search report and to be 

republished upon receipt of that report. 

Search Rpt 20040513 Late publication of international search report 

Republication 20040513 A3 With international search report. 

Republication 20040513 A3 Before the expiration of the time limit for 

amending the claims and to be republished in the 
event of the receipt of amendments. 

Fulltext Availability: 
Detailed Description 

Detailed Description 

faults, a better understanding of the likelihood of a fault is 
obtained by generating weighted probabilities to each of the possible 
faults. Step 106 illustrates a pplying a weight score vector 
to generate a Probability Vector. The fault diagnosis system 
calculates 

probabilities by weighting each score by a monotonic function, and 
normalising, so the total probabilities sum to unity. Preferably, an 
exponential weighting function is used. This provides a significantly 
more ... by formula . 

The vector of scores is taken, and then converted to a vector of 

probabilities , using the weighting function and normalisation as 
before . 

The probability calculation is illustrated in the following table. 

Score s Perceived probability e 20s Normalised 

probability 
0.1 7.4 0.00003 
0.6 162754.8 0.73103 
0.55 59874. . . 
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Detailed Description 
Claims 
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English Abstract 

Methods of creating libraries of biological polymers are provided. The 
construction of a library employs a probability matrix for a reference 
sequence, and a constraint vector for which is applied to the probability 
matirx to produce a substitution scheme. Thesubstitution scheme is then 
used to generate a library comprisingsubstitutions recommended by the 
substitution scheme. The librarymembers , or host cells comprising and/or 
expressing them, can be screened for desired changes in a property of 
interest in the biological polymers .in the library. 

French Abstract 

L 1 invention concerne des procedes permettant de creer des bibliotheques 
de polymeres biologiques. Le procede de construction de bibliotheque 
consiste a utiliser une matrice de probabilite comme sequence de 
reference, et a appliquer un vecteur contrainte a ladite matrice de 
probabilite afin d'obtenir un mecanisme de substitution. Ce mecanisme est 
ensuite utilise pour produire une bibliotheque comprenant des 
substitutions recommandees par ledit mecanisme de substitution. Des 
elements de bibliotheque ou des cellules hotes les comprenant et/ou les 
exprimant peuvent etre cribles pour rechercher les modifications desirees 
d ! une propriete consideree dans les polymeres biologiques de cette 
bibliotheque . 

Legal Status (Type, Date, Text) 

Publication 20020418 Al With international search report. 

Publication 20020418 Al Before the expiration of the time limit for 
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event of the receipt of amendments. 

Examination 20021121 Request for preliminary examination prior to end of 

19th month from priority date 

Fulltext Availability: 
Detailed Description 

Detailed Description 

synthetic methods, and combinations thereof 
A profile can be created from the matrix based on probability scores 
and 



> 



weighting factors. The probability matrix for a protein is preferably 
an n x 20 matrix that calculates the probability for any point 
mutation of the target gene that the mutation will result in a. . . 
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Modelling, in real time, hydrodynamic behavior of multi-phase fluid flow 
in transitory phase in pipe, comprises series of neuron networks 

Patent Assignee: INST FRANCAIS DU PETROLE (INSF ) 

Inventor: HENRIOT V; REY F I; TRAN Q H; REY-FABRET I; TRAN Q 



Number of Countries: 005 Number of Patents: 002 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

FR 2848320 Al 20040611 FR 200215570 A 20021210 200443 B 

WO 200463983 A2 20040729 WO 2003FR3583 A 20031203 200451 

Priority Applications (No Type Date) : FR 200215570 A 20021210 
Patent Details: 
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FR 2848320 Al 15 G06N-003/02 
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Designated States (National) : BR GB NO US 

Abstract (Basic) : FR 2848320 Al 

NOVELTY - A number of neuron networks (Estra, Edisp, Eint) are 
constructed, each dedicated to different fluid flow regimes. A 
probability neuron network (RNProba) is constructed to evaluate at all 
times the probability that the flow in the pipe corresponds to each of 
the flow regimes, and the results from the different neuron networks 
are combined weighted by the different probabilities . 

DETAILED DESCRIPTION - The process takes into account the operating 
conditions fixed over a certain number of structural parameters defined 
relative to the pipe, and an assembly of defined physical dimensions, 
with fixed ranges of variation for the said parameters and dimensions, 
using neuron networks with inputs for the parameters and dimensions and 
outputs producing the results required to estimate the hydrodynamic 
behavior, and at least one intermediate layer. The neuron networks are 
determined iteratively to adjust themselves from starter base values 
with predefined tables connecting different values obtained for the 
output data to corresponding values of input data. At least three 
neuron networks are constructed, dedicated respectively to stratified, 
dispersed and intermediate flow regimes. The probabilities are 
calculated for the fluid flow in the pipe to correspond to each of 
these regimes and the results are combined linearly. When the database 
is sufficiently detailed to distinguish sub-regimes inside the same 
flow regime, a neuron network for probability (RN Proba) is constructed 
to evaluate the probabilities of each flow regime at any moment. 

USE - Used for modelling the hydrodynamic behavior of a multi-phase 
flow in a pipe, in particular hydrocarbon flow in oil pipelines to 
avoid slugging. 

ADVANTAGE - Previous models have considered the overall flow rather 
than the different phases present. This method gives a more accurate 
model of flow in the pipe. 

DESCRIPTION OF DRAWING (S) - The figure shows an example of the 
model structure 

Neuron network examining dispersed flow (Edisp) 

Neuron network examining intermediate flow (Eint) 

Neuron network examining stratified flow (Estra) 

Probability of dispersed flow (PDisp) 

Probability of intermediate flow (Pint) 

Probability of stratified flow (Pstra) 

Neuron network for probability (RNProba) 

Overall hydrodynamic behavior (S) 

Hydrodynamic behavior in dispersed flow (Sdisp) 

Hydrodynamic behavior in intermediate flow (SInt) 

Hydrodynamic behavior in stratified flow (SStra) 
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Abstract (Basic) : HU 9903980 Al 

NOVELTY - A 1 'Gyori 11 type fl Z ,f index predicts flour quality 
obtainable from samples of winter-wheat. The "Z 1 1 index is arrived at 
by evaluation and suitable weighting of eleven quality 
characteristics. The values of these characteristics may be converted 
into points, and their sum-total becomes the ' 'Z ?l index. Using a 
conversion table it becomes possible to determine indirectly (in a 
complex manner) the flour quality of fundamental improving grades, as 
well as of commercially solid consignments. Data obtd. as a result of 
the characterisation process may be demonstrated with advantage on 
polygonal diagrams. 

USE - Establishing a ,! Gyori z index 1 1 for the determination of 
quality of flour obtainable from winter-wheat. 

pp; 1 DwgNo 0/0 
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On-line query supporting method for e-com in Internet, involves mapping 
terms in super category to documents category and weighting terms 
in received query to rank and select relevant super category term 
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Abstract (Basic) : WO 200058863 Al 

NOVELTY - A list of super category terms that are linked to 
specific application is prepared based on the category of documents 
to be searched and the listed terms are mapped against document 
category . The category is retrieved based on terms in user input 
query. The query is then modified and terms in the query are weighted 
to determine most relevant super category term by ranking method. 

DETAILED DESCRIPTION - The weighting of the modified query is 
performed by computing sum of term frequency and inverse document 
frequency of each term in the super category terms list. The inverse 
document frequency is set as high value , when terms appearing in the 
category is manually mapped against the super category, when compared 
to the terms that are automatically mapped. INDEPENDENT CLAIMS are also 
included for the following: 

(a) computer program for ranking super categories used for data 
query; 

(b) program for searching document; 

(c) program for establishing super category terms list' 

USE - For displaying on-line banner advertisements for user query 
for e-com in Internet. 

ADVANTAGE - The user's query can be cached and subset or superset 
of cached data can be referred for subsequent queries which enhances 
the response for subsequent user queries. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram. of 
software links of on-line query tool, 
pp; 186 DwgNo 4/71 
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Hypertext database traversing method for retrieving an electronic 
document in the hypertext database of a computer system 

Patent Assignee: C/NET INC (CNET-N) 

Inventor: YU T 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6067552 A 20000523 US 95517136 A 19950821 200049 B 

US 9852050 A 19980330 

Priority Applications (No Type Date) : US 9852050 A 19980330; US 95517136 A 

19950821 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

US 6067552 A 23 G06F-017/30 CIP of application US 95517136 

Abstract (Basic) : US 6067552 A 

NOVELTY - A list of electronic documents (108a-108c) is produced 
based on the result of comparison between a set of relevant index term 
values and sets of descriptive index terms (212a-212h) . The list is 
ranked with reference to the relevancy of each document with respect to 
a user, based on the weighted relevancy ranking . Each document in 
the list contains at least one relevant index term value. 

DETAILED DESCRIPTION - A weighted relevancy ranking of each 
descriptive index term (212a-212h) and a set of relevant index term 
values are received. The set of relevant index term values are compared 
with the sets of descriptive index terms using the weighted 
relevancy ranking . An INDEPENDENT CLAIM is also included for a 
computer-readable medium. 

USE - For retrieving an electronic document in the hypertext 
database of a computer system. 

ADVANTAGE - Reduces set of relevant links, and enables 
incorporation of expert knowledge of past relevance to determine 
present relevance of documents. Reduces elapsed user time for 
traversing the database. Allows user to control trade-off between 
complexity and number of intermediate links to the relevant documents. 

DESCRIPTION OF DRAWING (S) - The figure shows the diagram of a 
hypertext document database in which the documents are tagged with 
index terms . 

Documents (208a-208c) 

Descriptive index terms (212a-212h) 

pp; 23 DwgNo 2/6 

Title Terms: DATABASE; TRAVERSE; METHOD; RETRIEVAL; ELECTRONIC; DOCUMENT; 

DATABASE; COMPUTER; SYSTEM 
Derwent Class: T01 

International Patent Class (Main) : G06F-017/30 
File Segment: EPI 

Manual Codes (EPI/S-X) : T01-J05B1; T01-J05B4P; T01-J11C1 



20/9/14 (Item 14 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 



(c) 2004 Thomson Derwent . All rts. reserv. 

011564637 **Image available** 

WPI Acc No: 1997-541118/199750 

XRPX Acc No: N97-450454 

Voice recogniser for linguistic information - recognises input voice 
signal by evaluating its acoustic and linguistic probabilities and 
performs integrated evaluation by adjusting weighting factor attached 
to it 

Patent Assignee: FUJI XEROX CO LTD (XERF ) 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

JP 9258786 A 19971003 JP 9664980 A 19960321 199750 B 

Priority Applications (No Type Date): JP 9664980 A 19960321 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 9258786 A 10 G10L-005/06 

Abstract (Basic) : JP 9258786 A 

The recogniser has a speech recognition unit (2) which recognises 
the input voice through a microphone (1)'. The evaluation value of one 
or more acoustic probabilities and the evaluation value of one or 
more linguistic probabilities corresponding to the input voice signal 
are obtained. The recognised candidate is displayed on candidate window 
(7) after attaching weighting factor to both acoustic and linguistic 
signals and an integrated value is obtained. The user chooses the 
correct recognition of voice. 

The exact recognition of the input voice signal is performed by 
adjusting the weighting value of both the acoustic probability and the 
linguistic probability performed by weight coefficient calculation 
part (11), which gives exact evaluation of voice with a higher degree 
of accuracy than that of mistaken recognition result. 

ADVANTAGE - Eliminates need for pretraining with large number of 
utterances. Enables elimination of free adjustment of lot of data for 
voice recognition. Enables good recognition capability. Enables good 
optimisation of voice signal for recognition at higher rate. 
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Inference appts . for cause of mechanical failure - calculates 
probability of cause using data on corrected degree of cause relevance 
and comparison of predicted and actual failure rate 
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Priority Applications (No Type Date): JP 93165653 A 19930705; JP 93165652 A 
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Abstract (Basic) : WO 9502216 A 

The appts for inferring the causes of a failure of e.g a driving 
machine has data on the present abnormality are first input. When a 
true cause is found the data on the degree of relevance of that cause 
is corrected, raising its relevance while the relevance of other causes 
is lowered. Using failure rate data and data on the degree of 
relevance, the probability of each cause is calculated. 

Predicted failure rate and actual failure rate data are then 
compared and the probability level of causes having fewer errors is 
raised, giving a second inference of probability. The two levels of 
probability are weighted , and a final probability is inferred 
based on the weighted inferred probabilities . 

USE/ADVANTAGE - Simple and reliable method of inferring cause of 
failure. Esp. for use with construction machinery, plant, etc. 
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Developing fuzzy controller - involves determining and weighting 
probability values using learning, evaluation and synthesis phases. 
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Abstract (Basic) : DE 4315948 A 

A fuzzifying arrangement determines the probability values for 
linguistic values of input parameters. A control mechanism determines 
linguistic output values for combinations of linguistic values of the 
input parameters. A defuzzifying arrangement determines the output 
values from the probabilities of the output parameter linguistic 
values . 

Value combinations of input and output parameters (el,e2,y) are 
measured in a learning phase. In an evaluation phase , dominant 
probabilities of linguistic values (K,G) are identified in value 
sub-regions and register contents are incremented accordingly. Rule 
synthesis involves selecting linguistic values with the largest 
register contents as satisfying the rule. 

USE/ADVANTAGE - For developing a fuzzy controller. The method 
overcomes some limitations of conventional methods and enables a rule 
base to be set without translation of process knowledge or operator 
experience. 
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Particle judging device esp. for imaging blood flow cytometer - compares 
calculated probability of characteristic parameter of detected 
particle with stored reference characteristic to make judgement 
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Abstract (Basic) : EP 553951 A 

The device detects each particle from a specimen, in which 
different kinds of particles may be intermixed. A detection signal is 
produced representing at least the value of one characteristic of the 
particle. The device also has the ability to store distribution 
characteristics for specific kinds of particle. 

The detection signal is converted in accordance with a 
predetermined rule to calculate the probability of the particle 
belonging to a specific kind. On the basis of the calculated 
probability , the device judges whether the detected particle belongs 
to a specific kind. 

ADVANTAGE - Identifies type of each particle caught by the detector 
with high accuracy in real time and also indicates the probability of 
accuracy of the estimation. 

Dwg.7/7 

Abstract (Equivalent) : US 5469375 A 

A particle judging device comprising: 

means for detecting each particle from a specimen in which plural 
kinds of particles exist intermixed and producing at least two 
detection signals each representing the value of one of at least two 
characteristic parameters of said particle; 

means for storing mean values and distribution widths for each of 
said characteristic parameters representative of various specific kinds 
of particle; 

means for calculating respective deviations of each of said 
detection signals from a corresponding one of said mean values; 

means for normalizing each respective deviation based upon a 
corresponding one of said distribution widths to obtain respective 
normalized deviations; 

means for storing respective predetermined probability distribution 
functions; 

means for . determining respective probabilities for each of said 
respective normalized deviations based upon a corresponding one of said 
probability distribution functions; 

means for weighting each of said respective probabilities of said 
normalized deviations in accordance with a predetermined rule to 
produce weighted probabilities ; and 

judging means for judging whether said particle belongs to any one 
of said specific kinds or not, on the basis of said weighted 
probabilities . 
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Statistically compensated optimisation method - identifying different 
potential yields obtainable for each cutting option, determining 
relative probability of each yield and determining relative value for 
each cutting option 

Patent Assignee: INOVEC INC (INOV-N) 

Inventor: DELEEUW P H 

Number of Countries: 003 Number of Patents: 003 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

CA 2072039 A 19921227 CA 2072039 A 19920623 199311 B 

US 5262956 A 19931116 US 91721831 A 19910626 199347 

CA 2072039 C 19950411 CA 2072039 A 19920623 199522 



Priority Applications (No Type Date) : US 91721831 A 19910626 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
CA 2072039 A 31 B27B-001/00 

US 5262956 A 15 G06F-015/46 

CA 2072039 C B27B-001/00 

Abstract (Basic) : CA 2072039 A 

The method involves considering different cutting options for 
cutting the piece, and identifying different potential yields 
obtainable for each cutting option. The relative probability of 
obtaining each potential option is determined preferably by averaging 
the potential yields for that option where each potential yield is 
statistically weighted by its relative probability . By comparing 
the relative values, the highest-valued cutting option is identified 
and cutting of the piece is performed in accordance with such option. 

The relative probabilities for the respective possible yields are 
preferably determined by empirically measured statistical attributes 
that characterise the uncertain variables that affect these yields and 
by grouping the uncertain variables into statistically significant 
classes. Each class corresponding to a respective yield, where the 
relative frequency of each class is determined from the underlying 
statistical attributes. Alternatively, the relative probabilities are 

determined by using statistical attributes that are based on a 
statistically expected distribution. 

USE - For optimising yield of useful material cut from a piece of 
raw material . 

Dwg. 11/13 

Abstract (Equivalent) : US 5262956 A 

The optimisation method involves considering different possible 
cutting options for cutting a piece of raw material and identifying 
different possible yields obtainable from each respective cutting 
option. A relative probability of obtaining each yield is determined 
including identifying a set of uncertain quantitative variables 
tangibly affecting them. 

A relative value for each cutting options is determined by 
combining the yields with the corresp. relative probabilities. The 
respective relative values of the respective possible cutting options 
are compared with one another and an optimal one is selected. The piece 



is then cut according to the optimal cutting option. 

USE /ADVANTAGE - computerised log cutting. Calculates highest 
possible yield from log. 
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08-248988 [JP 8248988 A] 
September 27, 1996 (19960927) 
MATSUOKA TATSUO 
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NIPPON TELEGR & TELEPH CORP <NTT> [0004 22] (A Japanese 
Company or Corporation), JP (Japan). 
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— Computer Applications) 

:R108 (INFORMATION PROCESSING — Speech Recognition & 
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ABSTRACT 

PURPOSE: To provide the method through which higher recognition/semantic 
understanding rates are obtained in a recognition processing as a whole by 
obtaining a higher probability recognition result grammatical or semantic 
from plural recognition result candidates obtained by an acoustic process. 



CONSTITUTION: The method is provided with an acoustic processing section 1 
which discriminates the recognition result from the acoustic features of 
inputted voices and a language processing section 3 which discriminates the 
recognition result from the grammatical features of the inputted voices. 
The section 1 outputs plural superior recognition result candidates based 
on acoustic evaluation values. The section 3 receives the candidates with 
their acoustic evaluation values, expresses the probability -ease of 

generation of word chains by a probability model, gives grammatical or 
semantic evaluation values for the candidates received from the section 1, 
generates overall evaluation values through the appropriately weighted 

linear sum of the acoustic evaluation values of the section 1 and the 
grammatical evaluation values and outputs the candidates having a high 
overall evaluation value as the recognition result. 
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ABSTRACT 

PURPOSE: To enable learning and renewal of a partial model even if traffic 
conditions or processing data are fluctuated, by performing relearning of a 
neural net . through a process of taking the actual using state data of a 
group supervisory operation controller,- and by continuing learning 
through a process of lowering the processing level by the specific time 
factor. 



CONSTITUTION: A group supervisory operation controller 1 is connected to 
single control units 2-1 to 2-N of respective elevators and a learning 
control unit 1-1 through a high-speed transmission passage 6, composed of a 
microcomputer, etc., and operated under the supervision of software. 
Input/output control units 4 of hall call buttons 3 installed on respective 
floors are connected to these control units through a low-speed 
transmission passage 1. The learning control unit 1-1 produces the traffic 
conditions and the group supervisory operation control responded result on 
the basis of information from an elevator group system for sending a 
signal through the group supervisory operation control unit 1 for every 
specific time, so as to take it as a data base for the suitable control 
parameter and on-line learning. The learning control unit 1-1 takes the 
control parameter to be set by the fixed period as a suitable evaluation 
index weighting value and transmits it to the group supervisory 

control unit 1 . 
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ABSTRACT 

PURPOSE: To execute an optimum route search by normalizing each term of an 
evaluation function and varying suitably a weighting coefficient of 
each term, even in case a circular arc-shaped route is contained in the 
route. 

CONSTITUTION: A start node, an object node, and continuous intermediate 
nodes are denoted as S, G and Vi, V j , respectively, and an evaluation 
function H(Vi, Vjx) for selecting the node Vj from the node Vi is defined 
by an expression. In this case, l(Vjx, G) , l(Vi, Vjx), and L denote a 
distance between the next candidate node Vjx and the object node G, a 
distance between the present node Vi and the next candidate node Vjx, and a 
distance of a diagonal line of a rectangle circumscribed to a moving area 
of a moving robot, respectively. That is to say, a value A(Vjx, G) is 
normalized by the distance L, and a value B(Vi, Vjx) is normalized by a 
value which has multiplied the distance L by the circular constant .pi.. Wa 
and Wb are coefficients for weighting of a variable A(Vjx, G) and B(Vi, 
Vjx) , respectively. In this way, an optimum route search can be executed 
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26/7/8 (Item 1 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

6851055 INSPEC Abstract Number: B2001-04-6135E-034 , C2001-04-5260B-064 

Title: Segmentation of FLIR images by Hopfield neural network with edge 
constraint 

Author(s): Nong Sang; Tianxu Zhang 

Author Affiliation: Inst. of Pattern Recognition & Artificial 
Intelligence, Huazhong Univ. of Sci. & Technol . , Hubei, China 
Journal: Pattern Recognition vol.34, no. 4 p. 811-21 
Publisher: Elsevier, 

Publication Date: April 2001 Country of Publication: UK 

CODEN: PTNRA8 ISSN: 0031-3203 

SICI: 0031-3203 (200104 ) 34 : 4L. 811 : SFIH; 1-X 

Material Identity Number: P133-2001-003 

U.S. Copyright Clearance Center Code: 0031-3203/2001/$20 . 00 
Document Number: S0031-3203 ( 00 ) 0004 1-8 

Language: English Document Type: Journal Paper (JP) 
Treatment: Applications (A); Practical (P) 

Abstract: A segmentation algorithm of forward-looking infrared (FLIR) 
images by Hopfield neural network (HNN) with edge constraint is presented. 
An evaluation criterion based on distinct edge pixels is used to examine 
the segmentation results by HNN under different initial assignment of 
probabilities . Thus, the good segmentation result can be achieved by 
automatically adapting initial assignment of probabilities to reach the 
optimal or suboptimal solution of the evaluation criterion . To 

determine appropriate weights of the objective function and the 

constraint condition in the energy of HNN, a criterion with respect to 
the constraint condition is proposed. Experimental results with real FLIR 
images are given. (12 Refs) 

Subfile: B C 

Copyright 2001, IEE 



26/7/10 (Item 3 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

6149596 INSPEC Abstract Number: C1999-03-1290-014 
Title: Fuzzy MCDM based on ideal and anti-ideal concepts 

Author (s): Gin-Shuh Liang 

Author Affiliation: Dept. of Shipping & Transp. Manage., Nat. Taiwan 
Ocean Univ., Keelung, Taiwan 

Journal: European Journal of Operational Research vol.112, no. 3 p. 
682-91 

Publisher : Elsevier, 

Publication Date: 1 Feb. 1999 Country of Publication: Netherlands 

CODEN: EJORDT ISSN: 0377-2217 

SICI: 0377-2217 (19990201) 112: 3L. 682: FMBI; 1-G 

Material Identity Number: E272-1999-001 

U.S. Copyright Clearance Center Code: 0377-2217/99/$20 . 00 
Document Number: S0377-2217 ( 97 ) 00410-4 

Language: English Document Type: Journal Paper (JP) 
Treatment: Theoretical (T) 

Abstract: This paper presents a novel fuzzy multicriteria decision making 
(MCDM) based on the concepts of ideal and anti-ideal points. The concepts 
of fuzzy set theory and hierarchical structure analysis are used to 
develop a weighted suitability decision matrix to evaluate the 

weighted suitability of different alternatives versus various criteria. 
The distance of different alternatives versus positive ideal solution and 
negative ideal solution are then obtained by using the proposed ranking 
method. Finally, the relative approximation values of various alternatives 
versus positive ideal solution are ranked to determine the best 
alternative. (18 Refs) 
Subfile: C 
Copyright 1999, IEE 



26/7/11 (Item 4 from file: 2) 

DIALOG (R) File 2: INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

6091606. INSPEC Abstract Number: C9901-6130-001 

Title: Rule-based classification procedures related to the unprecisely 
formulated expert rules 

Author (s): Kurzynski, M.W. ; Sas, J. 

Author Affiliation: Fac. of Electron .,■ Tech . Univ. of Wroclaw, Poland 
Conference Title: Proceedings SIBGRAPI'98. International Symposium on 

Computer Graphics, Image Processing, and Vision (Cat. No.98EX237) p. 

241-5 

Editor(s): da Fontoura Costa, L.; Camara, G. 
Publisher: IEEE Comput. Soc, Los Alamitos, CA, USA 
Publication Date: 1998 Country of Publication: USA xv+486 pp. 
ISBN: 0 8186 9215 4 Material Identity Number: XX98-02927 

U.S. Copyright Clearance Center Code: 0 8186 9215 4/98/$10.00 
Conference Title: Proceedings SIBGRAPI f 98. International Symposium on 
Computer Graphics, Image Processing, and Vision 

Conference Sponsor: Inst. Fisica de Sao Carlos (IFSC-USP) ; Inst. 
Pesquisas Espaciais (INPE); Sociedade Brasileira de Computacao (SBC); 
Fundacao de Amparo a Pesquisa do Estado de Sao Paulo (FAPESP) ; Fundacao de 
Amparo a Pesquisa do Estado do Rio de Janeiro (FAPERJ); Sociedade 
Brasileira de Computacao (SBC) ; Conselho Nacional de Desenvolvimento 
Cientifico e Technol. (CNPq) ; Fundacao Coordenacao de Aperf eioamento de 



Pessoal de Nivel Superior (CAPES) 

Conference Date: 20-23 Oct. 1998 Conference Location: Rio de Janeiro, 
Brazil 

Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P) 

Abstract: The paper is devoted to the pattern recognition procedure based 
on the set of expert rules with unprecisely formulated weights understood 
as conditional probabilities . Adopting the probabilistic model, the 
recognition algorithm is derived and evaluation of its probability of 
misclassif ication is given. Furthermore, the case with both expert rules 
and the learning set is considered. The proposed algorithms were 
empirically tested on the computer generated data and in computer aided 
diagnosis of acute renal failure and compared with sample based (k-NN) 
algorithm. (8 Refs) 

Subfile: C 

Copyright 1998, IEE 



26/7/15 (Item 8 from file: 2) 

DIALOG (R) File 2: INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

5109303 INSPEC Abstract Number: C9512-7250R-04 4 
Title: Term- relevance computations and perfect retrieval performance 
Author (s): Shaw, W.M., Jr. 

Author Affiliation: Sch. of Inf. & Libr. Sci., North Carolina Univ., 
Chapel Hill, NC, USA 

Journal: Information Processing & Management vol.31, no. 4 p. 4 91-8 
Publication Date: 1995 Country of Publication: UK 
CODEN: IPMADK ISSN: 0306-4573 

U.S. Copyright Clearance Center Code: 0306-4573/95/$9 . 50+0 . 00 
Language: English Document Type: Journal Paper (JP) 
Treatment: Theoretical (T) 

Abstract: Computing formulas for binary independent (BI) term relevance 
weights are evaluated as a function of query representations and 
retrieval expectations in the CF database. Query representations consist of 
the limited set of terms appearing in each query statement and the complete 
set of terms appearing in the database. Retrieval expectations include 
comprehensive searches, for which many relevant documents are sought, and 
specific searches, for which only a few documents have merit. Conventional 
computing equations, which are known to overestimate term relevance 

weights , are shown to produce mediocre results for all combinations of 
query representations and retrieval expectations. Modified computing 
equations, which do not over estimate relevance weights , produce 
essentially perfect retrieval results for both comprehensive and specific 
searches, when the query representation is complete. Probabilistic 
retrieval, based on BI assumptions and applied to simple subject 
descriptions of documents and queries, can retrieve all relevant documents 
and -only relevant documents, when term relevance weights are computed 
accurately. (14 Refs) 
Subfile: C 
Copyright 1995, IEE 



26/7/16 (Item 9 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

5092587 INSPEC Abstract Number: C9512-724 0-011 

Title: Applying probabilistic term weighting to OCR text in the case 



of a large alphabetic library catalogue 

Author(s): Mittendorf, E. ; Schauble, P.; Sheridan, P. 

Author Affiliation: Swiss Federal Inst, of Technol., Zurich, Switzerland 
Journal: SIGIR Forum Conference Title: SIGIR Forum (USA) spec, issue, 
p. 328-35 

Publication Date: 1995 Country of Publication: USA 
CODEN: FASRDV ISSN: 0163-5840 

Conference Title: 18th International ACM SIGIR Conference on Research and 
Development in Information Retrieval 
Conference Sponsor: ACM 

Conference Date: 9-13 July 1995 Conference Location: Seattle, WA, USA 
Language: English Document Type: Conference Paper (PA); Journal Paper 
(JP) 

Treatment: Practical (P) 

Abstract : We report on a probabilistic weighting approach to indexing 
the scanned images of very short documents. This fully automatic process 
copes with short and very noisy texts (67% word accuracy) derived from the 
images by optical character recognition (OCR) . The probabilistic term 

weighting approach is based on a theoretical proof explaining how the 
retrieval effectiveness is affected by recognition errors. We have 

evaluated our probabilistic weighting approach on a sample of index 
cards from an alphabetic library catalogue where, on the average, a card 
contains only 23 terms. We have~ demonstrated over 30% improvement in 
retrieval effectiveness over a conventional weighted retrieval method where 
the recognition errors are not taken into account. We also show how we can 
take advantage of the ordering information of the alphabetic library 

catalogue . (13 Refs) 
Subfile: C 
Copyright 1995, IEE 



26/7/20 (Item 13 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

02173980 INSPEC Abstract Number: C84006284 
Title: Fuzzy sets and generalized Boolean retrieval systems 
Author (s): Kraft, D.H.; Buell, D.A. 

Author Affiliation: Dept. of Computer Sci . , Louisiana State Univ., Baton 
Rouge, LA, USA 

Journal: International Journal of Man-Machine Studies vol.19, no.l 
p. 45-56 

Publication Date: July 1983 Country of Publication: UK 
CODEN: IJMMBC ISSN: 0020-7373 

U.S. Copyright Clearance Center Code: 0020-7373/83/070045+12$03 . 00/0 
Language: English Document Type: Journal Paper (JP) 
Treatment: Bibliography (B) ; Theoretical (T) 

Abstract: Substantial work is done on the application of fuzzy subset 
theory to information retrieval. Boolean query processing is generalized to 
allow for weights to be attached to individual terms, in either the 
document indexing or the query representation, or both*. Problems with the 
generalized Boolean lattice structure are noted, and an alternative 
approach using query thresholds and appropriate document evaluation 

functions is suggested. Problems remain unsolved, however. Criteria 
generated for the query processing mechanism are inconsistent. The exact 
functional form and appropriate parameters for the query processing 
mechanism must be specified. Moreover, the generalized Boolean query model 
must be reconciled with the vector space approach, suggested for weighted 

retrieval, and probabilistic retrieval models. Finally, proper retrieval 
evaluation mechanisms reflecting the fuzzy nature of retrieval are needed. 



(12 Refs) 

Subfile: C 
? t26/7/22-23, 25, 27,30, 32-33 

26/7/22 (Item 15 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2004 Institution of Electrical Engineers. All rts. reserv. 

01133542 INSPEC Abstract Number: C78000299 

Title: A note of the choice of a weighting function to give an efficient 
method for estimating the probability of misclassif ication 

Author (s): McLachlan, G.J. 

Author Affiliation: Dept. of Math., Univ. of Queensland, St. Lucia, Qld., 
Australia 

Journal: Pattern Recognition vol.9, no. 3 p. 147-9 
Publication Date: Oct. 1977 Country of Publication: UK 
CODEN: PTNRA8 ISSN: 0031-3203 

Language: English Document Type: Journal Paper (JP) 
Treatment: Theoretical (T) 

Abstract: The problem of estimating the performance of a given 
classifier on a given data set is considered. In an attempt to provide 
a nonparametric estimator which not only uses the data efficiently but is 
also essentially an unbiased estimator of the probability of 
misclassif ication, Toussaint evaluated empirically an estimator formed by 
weighting the resubstitution and rotation estimators. In this study 
theoretical consideration is given to the choice of a suitable weighting 
function for this estimator. (13 Refs) 
Subfile: B C 



26/7/23 (Item 1 from file: 8) 

DIALOG (R) File 8:Ei Compendex(R) 

(c) 2004 Elsevier Eng. Info. Inc. All rts. reserv. 

05911854 E.I. No: EIP0138560164 6 

Title: Segmentation of FLIR images by Hopfield neural network with edge 
constraint 

Author: Sang, N . ; Zhang, T. 

Corporate Source: Inst. Pattern Recog. Artif. Intel. Huazhong Univ. of 
Sci. and Tech., Wuhan, Hubei 430074, China 

Source: Pattern. Recognition v 34 n 4 April, p 811-821 

Publication Year: 2001 

CODEN: PTNRA8 ISSN : 0031-3203 

Language: English 

Document Type: JA; (Journal Article) Treatment: A; (Applications); T; 
(Theoretical) 

Journal Announcement: 0110W4 

Abstract: A segmentation algorithm of forward-looking infrared (FLIR) 
images by Hopfield neural network (HNN) with edge constraint is presented. 
An evaluation criterion based on distinct edge pixels is used to 
examine the segmentation results by HNN under different initial assignment 
of probabilities . Thus, the good segmentation result can be achieved by 
automatically adapting initial assignment of probabilities to reach the 
optimal or suboptimal solution of the evaluation criterion . To 
determine appropriate weights of the objective function and the 
constraint condition in the energy of HNN, a criterion with respect to 
the constraint condition is proposed. Experimental results with real FLIR 
images are given, copy 2001 Pattern Recognition Society. Published by 
Elsevier Science Ltd. All rights reserved. (Author abstract) 12 Refs. 



26/7/25 (Item 3 from file: 8) 

DIALOG (R) File 8:Ei Compendex (R) 

(c) 2004 Elsevier Eng. Info. Inc. All rts. reserv. 

04620959 E.I. No: EIP97023516011 

Title: Quality improvement for RC06 chip resistor 

Author: Jeng, Yann-Chyn; Guo, Shin-Ming 

Corporate Source: Kaohsiung Polytechnic Inst, Ta-hsu Hsiang, Taiwan 
Source: Quality and Reliability Engineering International v 12 n 6 
Nov-Dec 1996. p 439-445 r 
Publication Year: 1996 
CODEN: QREIE5 ISSN: 0748-8017 
Language: English 

Document Type: JA; (Journal Article) Treatment: T; (Theoretical); X; 
(Experimental) 

Journal Announcement: 97 04W1 

Abstract: This paper studies a quality improvement case of extremely thin 
and light chip resistor RC06. We use an L//l//8(2**l multiplied by 3**7) 
orthogonal array allocating eight control factors in an experimental plan. 
The quality response data are inevitably considered to be ordered 
categorical . Six categories are classified for the quality of chips. 
Both Taguchi f s accumulation analysis method (1966) and Nair f s scoring 
scheme (1986) are employed in analysing the data. Furthermore, we develop a 
weighted probability scoring scheme (WPSS) and a signal-to-noise (SN) 
ratio to reach an optimal solution. Finally, a comparison among the three 
approaches is made. (Author abstract) 4 Refs. 



26/7/27 (Item 5 from file: 8) 

DIALOG (R) File 8 : Ei Compendex (R) 

(c) 2004 Elsevier Eng. Info. Inc. All rts. reserv. 

04337361 E.I. No: EIP96013013278 

Title: Analyzing ordered categorical data from orthogonal array 

Author: Jeng, Yann-Chyn; Guo, Shin-Ming 

Corporate Source: Kaohsiung Polytechnic Inst, Kaohsiung, Taiwan 

Conference Title: Proceedings of the 1995 IEEE International Conference 
on Systems, Man and Cybernetics. Part 3 (of 5) 

Conference Location: Vancouver, BC, Can Conference Date: 
19951022-19951025 

Sponsor: IEEE 

E.I. Conference No.: 44222 

Source: Proceedings of the IEEE International Conference on Systems, Man 
and Cybernetics v 3 1995. IEEE, Piscataway, NJ, USA, 95CB357 67 . p 2828-2833 
Publication Year: 1995 
CODEN: PICYE3 ISSN: 0884-3627 
Language: English 

Document Type: CA; (Conference Article) Treatment: A; (Applications); G 
; (General Review) 

Journal Announcement: 9603W4 

Abstract: This paper studies a quality improvement case of extremely thin 
and light chip resistor RC06. We use an L//l//8(2**l multiplied by 3**7) 
orthogonal array allocating 8 control factors in an experimental plan. The 
quality response data are inevitably considered to be ordered categorical 
. Six categories are classified for the quality of chips. Both 
Taguchi's accumulation analysis method (1966) and Nair's scoring scheme 
(1986) are employed in analyzing the data. Furthermore, we develop a 
weighted probability scoring scheme (WPSS) and a signal to noise (SN) 
ratio to reach an optimal solution. Finally, a comparison among the three 



approaches is made. (Author abstract) 5 Refs. 



26/7/30 (Item 8 from file: 8) 

DIALOG (R) File 8 : Ei Compendex (R) 

(c) 2004 Elsevier Eng. Info. Inc. All rts. reserv. 

03463898 E.I. Monthly No: EI9208103639 

Title: Personnel placement in a fuzzy environment. 

Author: Liang, Gin-Shuh; Wang, Mao-Jiun J. 

Corporate Source: Natl Tsing Hua Univ, Hsinchu, Taiwan 

Source: Computers & Operations Research v 19 n 2 Feb 1992 p 107-121 

Publication Year: 1992 

CODEN: CMORAP ISSN: 0305-0548 

Language : English 

Document Type: JA; (Journal Article) Treatment: A; (Applications); T; 
(Theoretical) 

Journal Announcement: 9208 

Abstract: Evaluating personnel suitability is very important for 
decision-makers to select a best candidate under various evaluation 
criteria or to seek an optimal matching between personnel and jobs within 
the constraints dictated by available human resources and jobs. By means of 
aggregating decision-makers 1 fuzzy assessment about criteria weightings 
and personnel suitability ratings relative to various evaluation 
criteria , the fuzzy suitability indices can be obtained. Combining the 
concepts of fuzzy set theory and the weighted complete bipartite graph, a 
polynomial time algorithm for personnel placement under fuzzy environment 
is developed. Using this algorithm, the decision-makers can maximize 
personnel utilization and increase job effectiveness. (Author abstract) 28 
Refs. 



26/7/32 (Item 10 from file: 8) 

DIALOG (R) File 8:Ei Compendex (R) 

(c) 2004 Elsevier Eng. Info. Inc. All rts. reserv. 

01447804 E.I. Monthly No: EIM8310-071324 

Title: CRITERIA WEIGHTING: THE RIGHT TO BE RIGHT. 

Author: Fiorelli, James A. 

Corporate Source: Johnson Controls Inc, Milwaukee, Wis, USA 
Conference Title: SAVE Proceedings, Volume 16, 1981, International 
Conference . 

Conference Location: St. Louis, Mo, USA Conference Date: 19810426 
Sponsor: Soc of American Value Engineers, Irving, Tex, USA 
E.I. Conference No.: 02447 

Source: SAVE Proceedings (Society of American Value Engineers) v 16. Publ 
by Soc of American Value Engineers, Irving, Tex, USA p 105-109 
Publication Year: 1981 
CODEN: SAPRDN 
Language: English 

Document Type: PA; (Conference Paper) 
Journal Announcement: 8310 



26/7/33 (Item 1 from file: 34) 

DIALOG (R) File 34 : SciSearch (R) Cited Ref Sci 
(c) 2004 Inst for Sci Info. All rts. reserv. 

09997887 Genuine Article*: 474GL Number of References: 31 

Title: Probabilistic neural networks for segmentation of features in corn 



kernel images 

Author(s): Steenhoek LW (REPRINT) ; Misra MK; Batchelor WD; Davidson JL 

Corporate Source: Pioneer Hi Bred Int Inc, 6900 NW 62nd/ J6hnston//IA/50131 
(REPRINT); Pioneer Hi Bred Int Inc, Johnston//IA/50131 ; Iowa State 
Univ, Dept Elect & Comp Engn, Ames//IA/ ; Iowa State Univ, Seed Sci 
Ctr, Ames//IA/; Iowa State Univ,Ag & Biosyst Eng Dept , Ames// IA/ 

Journal: APPLIED ENGINEERING IN AGRICULTURE, 2001, V17, N2 (MAR), P225-234 

ISSN: 0883-8542 Publication date: 20010300 

Publisher: AMER SOC AGRICULTURAL ENGINEERS, 2950 NILES RD, ST JOSEPH, MI 

49085-9659 USA 
Language: English Document Type: ARTICLE 

Abstract: A method is presented for clustering of pixel color information 
to segment features within corn kernel images. Features for blue-eye 
mold, germ damage, sound germ, shadow in sound germ, hard starch, and 
soft starch were identified by red, green, and blue (RGB) pixel value 
inputs to a probabilistic neural network. A data grouping method 
to obtain an exemplar set for adjustment of the Probabilistic Neural 
Network (PNN) weights and optimization of a universal smoothing 
factor is described. Of the 14,427 available exemplars (RGB pixel 
values sampled from previously collected images), 778 were used for 
adjustment of the network weights, 737 were used for optimization of 
the PNN smoothing parameter and 12,912 were reserved for network 
validation. Based on a universal PNN smoothing factor of 0.05, the 
network was able to provide an overall pixel classification accuracy 
of 86% on calibration data and 75% on unseen data. Much of the 
misclassif ication was due to overlap of pixel values among classes. 
When an additional network layer was added to combine similar classes 
(blue-eye mold and germ damage, sound germ and shadow in sound germ, 
and hard and soft starch) , network results were significantly enhanced 
so that accuracy* on validation data was 94.7%. Image quality was shown 
to be important to the success of this algorithm as lighting and camera 
depth of field effects caused artifacts in the segmented images. 

? t26/7/38, 41, 44 

26/7/38 (Item 6 from file: 34) 

DIALOG (R) File 34 : SciSearch (R) Cited Ref Sci 
(c) 2004 Inst for Sci Info. All rts. reserv. 

07332682 Genuine Article#: 151 JF Number of References: 20 
Title: Fuzzy MCDM based on ideal and anti-ideal concepts 

Author(s): Liang GS (REPRINT) 

Corporate Source: NATL TAIWAN OCEAN UNIV, DEPT SHIPPING & TRANSPORTAT 

MANAGEMENT /CHI LUNG/ /TAIWAN/ (REPRINT) 
Journal: EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1999, V112, N3 (FEB 1) 
, P682-691 

ISSN: 0377-2217 Publication date: 19990201 • 

Publisher: ELSEVIER SCIENCE BV, PO BOX 211, 1000 AE AMSTERDAM, NETHERLANDS 
Language: English Document Type: ARTICLE 

Abstract: This paper presents a novel fuzzy multiple criteria decision 

making (MCDM) based on the concepts of ideal and anti-ideal points. The 

concepts of fuzzy set theory and hierarchical structure analysis 
are used to develop a weighted suitability decision matrix to 
evaluate the weighted suitability of different alternatives versus 
various criteria. The distance of different alternatives versus 
positive ideal solution and negative ideal solution are then obtained 
by using the proposed ranking method. Finally, the relative 
approximation values of various alternatives versus positive ideal 
solution are ranked to determine the best alternative. (C) 1999 
Elsevier Science B.V. All rights reserved. 



26/7/41 (Item 9 from file: 34) 

DIALOG (R) File 34 : SciSearch ( R) Cited Ref Sci 
(c) 2004 Inst for Sci Info. All rts. reserv. 

06344666 Genuine Article*: YL071 Number of References: 39 

Title: Evaluating the effectiveness of sequence analysis algorithms using 

measures of relevant information 
Author (s): Wootton JC (REPRINT) 

Corporate Source: NATL LIB MED, NATL CTR BIOTECHNOL INFORMAT, NIH, BLDG 38A, 

ROOM 8N805/BETHESDA//MD/20894 (REPRINT) 
Journal: COMPUTERS & CHEMISTRY, 1997, V21, N4, P191-202 
ISSN: 0097-8485 Publication date: 19970000 

Publisher: PERGAMON-ELSEVIER SCIENCE LTD, THE BOULEVARD, LANG FORD LANE, 

KIDLINGTON, OXFORD,' ENGLAND OX5 1GB 
Language: English Document Type: ARTICLE 

Abstract: Given vast quantities of molecular sequence data, and numerous 
different algorithms designed to discover, diagnose or model 
biologically interesting features in sequences, how is it possible to 
make objective evaluations of the diagnostic effectiveness of these 
algorithms and robust assessments of their relative strengths and 
limitations? An approach to this relatively neglected question is 
developed here, which is based on information measures of the 
diagnostic efficiency of different methods. From output lists of a 
procedure such as a database search, ' 1 relevance weights ' 1 are 
assigned that encode, for each sequence listed, the level of associated 
scientific evidence implicating that sequence as an example of a 
feature of interest. Relevance weights may be derived , following 
systematic protocols, from expert human judgement or, in principle, by 
automated information retrieval from electronic resources. Practical 
applications of this approach to algorithm assessment and development 
and parameter choice are demonstrated with examples of automated 
sequence motif modeling for the DNA-binding helix-turn-helix motif and 
the guanine exchange factor protein domain. The combined use of 
relevance weights and information measures appears to offer 
promising advantages over ROC analysis and may be generally applicable 
to diagnostic evaluation . Published by Elsevier Science Ltd. 



26/7/44 (Item 12 from file: 34) 

DIALOG (R) File 34 : SciSearch (R) Cited Ref Sci 
(c) 2004 Inst for Sci Info. All rts. reserv. 

04195359 Genuine Article*: RM514 Number of References: 29 
Title: CLASSIFYING THROUGH AN ALGEBRAIC FUZZY STRUCTURE - THE RELEVANCE 
OF THE ATTRIBUTES 

Author (s) : GISOLFI A 
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Journal: INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1995, V10, N8 (AUG) 
, P715-734 
ISSN: 0884-8173 

Language: ENGLISH Document Type: ARTICLE 

Abstract: An appropriate fuzzy structure was previously defined to 

generate appropriate classifications in a universe of discourse in 
which the characteristics of the elements are defined through 
attributes whose values are linguistic labels. The way the structure 
operates depends upon suitable operations on fuzzy attributes 
represented by fuzzy numbers. However, this classification method 
suffers from a main disadvantage, i.e., the inherent rigidity deriving 



from the circumstance that all the attributes share the same weight 
and, consequently, are taken into account completely disregarding the 
fact that usually some attributes are more relevant than others. In 
this article we analyze a suitable weighting rule allowing us to 
treat in a different way attributes whose relevance is different. The 
validity of the approach was checked by means of a case study 
concerning diets for breast-feeding women. (C) 1995 John Wiley & Sons, 
Inc . 
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Extended weights -of -evidence modelling for predictive mapping of base 
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Approaches to mineral potential mapping based on weights of evidence 
generally use binary maps, whereas, real-world geospatial data are mostly 
multi-class in nature. The consequent reclassification of multi-class maps 
into binary maps is a simplification that might result in a loss of 
information. This paper describes results of using multi-class evidential 
maps in an extended weights-of-evidence model vis-a-vis results of using 
binary evidential maps in a simple-weights-of-evidence model. The study 
area in the south-central part of Aravalli province (western India) hosts a 
number of SEDEX-type base metal deposits in Proterozoic supracrustal rocks. 
Recognition criteria for base metal deposits were represented as both 
multi-class and binary evidential maps. The known mineral deposits were 
divided into two subsets, viz., the training and the validation subsets. 
The training subset was used to calculate, for the evidential maps, the 

weights , contrasts, and posterior probabilities and their variances. 
The distributions of expected frequencies of base metal deposits estimated 

from the posterior probabilities and the observed frequencies were 

compared using standard goodness -of-fit tests to verify conditional 
independence of the input evidential maps. The posterior probabilities 

from both the models were mapped and interpreted to classify the study 
area into zones favorable, permissive, and non-permissive for base metal 
deposit occurrence. As compared to the simple weights-of-evidence model, 
the extended weights-of-evidence model results in more robust and finely 
differentiated posterior probabilities in favorable and permissive zones 
and has a better prediction rate. The results also reveal that the 
statistical properties of the weights of evidence, the contrasts, and the 
posterior probabilities are not significantly degenerated by using 
multi-class evidential maps in weights-of-evidence modelling. 
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The Fourth Text REtrieval Conference (TREC-4) , Gaithersburg, Maryland, 
November 1-3 , 1995 
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United States.; Defense Advanced Research Projects Agency (DARPA), United 
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Journal: NIST special publication, 1996 (500236) 65-72 

ISSN: 1048-776X Availability: INIST-14185; 354000044 947390040 

No. of Refs. : 10 ref . 

Document Type: P (Serial); C (Conference Proceedings) ; A (Analytic) 
Country of Publication: United States 
Language : English 

The Berkeley experiments for TREC-4 extend those of TREC-3 in three ways 
: for ad-hoc retrieval we retain the manual reformulation of the topic 
and experiment with limited query expansion based upon the assumption that 
top documents are relevant (this experiment was an interesting failure) / 
for routing retrieval we introduce a logistic regression wich assumes 
relevance weights to be only one clue among several in predicting 

probability of relevance. Finally, for Spanish retrieval we obtain the 
basic logistic regression equation to apply to the statistical 
distributions Spanish words. In addition we apply two approaches to Spanish 
stemming, one which attemps to resolve verb into a standardized form, the 
other of which eschews stemming in favor of a massive stop word list of 
variants of common words 

Copyright (c) 1997 INIST-CNRS. All rights reserved. 
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Standard error and sample size determination for estimation of 
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A method of sample size determination for estimation of 
probabilities based on a test variable is presented. Applications to 
estimation of sensitivity and specificity of medical tests are the focus of 
this research, although the methods can be applied to other areas of study 
such as engineering reliability. Examples are given for determining sample 



sizes required for the classification of patients with cutaneous lupus 
erythematosus based on the incidence of several markers. In this example, 
the test variable is the number of markers present. The methodology employs 
a weighted average of model-based and non-model-based estimates of the 

probability with the weights determined by the closeness to or the 
confidence in the given model. Formulas and charts required for determining 
sample size are provided for test variables that can be modeled by the 
binomial, Poisson, or normal distributions, i.e., for the most commonly 
encountered distributions for counting events (binomial and Poisson) and 
for measurements (normal) . However, the methods given can be applied to any 
distribution, including multivariate. Especially when relatively small 

probabilities (the rare events) are being estimated, the techniques 
provided assistance in safeguarding against undersampling brought on by 
unwarranted confidence in a test variable distribution and against 
oversampling required for high accuracy in non-model-based probability 

estimators . 

Copyright (c) 1996 INIST-CNRS. All rights reserved. 
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An online library catalogue served as a testbed to evaluate an 
interactive query expansion facility based on relevance feedback for the 
Okapi probabilistic term weighting retrieval system. The facility was 
implemented in a graphical user interface (GUI) environment using a 
game-board metaphor for the search process, and allowed searchers to select 
candidate terms extracted from relevant retrieved items to reformulate 
queries. The take-up of the interactive query expansion option was found to 
be lower, and its retrieval performance less effective, compared to 
previous tests featuring automatic query expansion. Contributory factors 
including the number, presentation and source of terms are discussed. 
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ONTENT? ? OR ASSET? ?) 

54 18360 S2(5N)(TREE? ? OR HIERARCH? OR SUBTREE? OR PYRAMID? OR TRE- 
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Process improvement in the presence of qualitative response by combining 
fuzzy sets and neural networks 

Hsieh, Kun-Lin 

Integrated Manufacturing Systems vl2n6/7 PP: 449-462 2001 
ISSN: 0957-6061 JRNL CODE: ING 
WORD COUNT: 47 51 

...TEXT: control. In some cases, the interested quality response may be a 
qualitative (or a linguistically categorical ) response. However, 
optimization of a linguistically categorical response has seldom been 
reported (Nair, 1986; Jean and Guo, 1996) . To achieve the quality 
improvement involving a linguistic category , a linguistic response is 
generally represented using the percentages, or it is classified into 
several categories . Discriminant analysis (Johnson and Wichern, 1992; 
Sharma, 1996) can be performed to recognize the relevant factors when 
analyzing a qualitative or a linguistic response problem. Taguchi (1966) 
proposed an accumulation analysis (AA) to achieve the parameter 
optimization of an ordered categorical response. In a later- work, Nair 
(1986) developed a scoring scheme (SS) , and Jean and Guo (1996) modified 
Nair's SS to propose a weighted probability scoring scheme (WPSS) for 
addressing the parameter optimization of an ordered categorical response. 
However, the implicit information among the contiguous categories can 
not be feasibly considered in their approaches. The flexibility of 
utilization will be significantly limited when applying their approaches 
into quality improvement of a qualitative or a categorical response. 

The qualitative form can also be described linguistically (Turksen and 
Willson, 1996). For instance... 



c 



22/3, K/5 (Item 5 from file: 15) 

DIALOG (R) File 15 : ABI/Inf orm (R) 

(c) 2005 ProQuest Inf o&Learning . All rts. reserv. 
01765704 04-16695 

Fuzzy MCDM based on ideal and anti -ideal concepts 
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ABSTRACT: A novel fuzzy multiple criteria decision making based on the 
concepts of ideal and anti-ideal points is presented. The concepts of 

fuzzy set theory and hierarchical structure analysis are used to develop 

a weighted suitability decision matrix to evaluate the weighted 

suitability of different alternatives versus various criteria . The 

distance of different alternatives versus positive ideal solution and 

negative ideal solution are then. . . 
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Term relevance computations and perfect retrieval performance 

Shaw, W M Jr 

Information Processing & Management v31n4 PP: 4 91-4 98 Jul 1995 
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ABSTRACT: Computing formulas for binary independent term relevance 

weights are evaluated as a function of query representations and 
retrieval expectations in the CF database. Query representations... 

...only a few documents have merit. Conventional computing equations, which 
are known to overestimate term relevance weights , are shown to produce 
mediocre results for all combinations of query representations and 
retrieval expectations. Modified computing equations, which do not 
overestimate relevance weights , produce essentially perfect retrieval 
results for both comprehensive and specific searches, when the query 
representation is complete. Probabilistic retrieval, based on BI 
assumptions and applied to simple subject descriptions of documents and 
queries, can retrieve all relevant documents and only relevant documents, 
when term relevance weights are computed accurately. 
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27-36 Winter 1991 
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.ABSTRACT: scaling technique is recommended as a useful tool for scaling 



the importance weights associated with evaluative criteria and the 
probable performance of suppliers on these criteria . Thurstone scaling 

eliminates the need for traditional point estimates of these values when 

employing weighted point supplier evaluation models. Instead, decision 

makers simply rank order the relative importances of evaluative criteria 
and the projected performance of suppliers on these criteria . Thurstone 

scaling is then used to transform the rank ordered data into appropriate 
numeric scales for inclusion in the weighted point model. The 
appropriate use of Thurstone Case V scaling can simplify the supplier 

rating process. Thurstone Case V... 
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A Comparison of Two Systems of Weighted Boolean Retrieval 

Bookstein, A. 
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...ABSTRACT: traditional Boolean retrieval system is that it does not allow 
users or indexers to indicate relative importance of topics . One of the 
2 approaches that have been taken to solve this problem is that... 

. . . introducing a weighted capability that is associated with fuzzy-set 
theory. This weighted indexing is compared to probabilistic weighting 
, in which the weights are interpreted as probabilities . A discussion 
of 3 models, in which assumptions are made about the conditional 
probabilities, is presented to better understand the relation between the 
fuzzy-set model and probabilistic indexing. The analyses show that 
when the value of the weights are restricted, all cases seem to converge. . . 
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Threshold Values and Boolean Retrieval Systems 

Buell, Duncan A.; Kraft, Donald H. 

Information Processing & Management vl7n3 PP: 127-136 1981 
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...ABSTRACT: rather than strict yes-or-no indexing decisions, a 11 fuzzy 11 
decision based on approximates. The concept of threshold values is shown 
to solve the problem inherent in relevance weights . Other possible 
evaluative mechanisms for document retrieval may be based on 
fuzzy-set-theoretic considerations. ... 
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Text Retrieval Products for Libraries . (Technology Information) (Statistical 
Data Included) 

Saffady, William 

Library Technology Reports, 36, 2, 5 
March, 2000 

DOCUMENT TYPE: Statistical Data Included ISSN: 0024-2586 

LANGUAGE: English RECORD TYPE: Fulltext 

WORD COUNT: 35970 LINE COUNT: 03217 

... and the number of occurrences of the search term(s) within the 

document. Documents are ranked by relevance for natural language 
searches and by the number of occurrences of search terms for Boolean 
searches. Relevance ranking is based on the distribution of search 
terms within documents. To influence relevance calculations , weights 
can be used to emphasize specific search terms. Thus, a search expression 
of the form. . . 

, . .will instruct the program to count OCLC as five times more important 
than RLIN when calculating relevance . 

Any document can be selected for display with search terms 
highlighted. Documents can be viewed. . . 
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Uncertainty in Social Security Trust Fund Projections. 
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Sept, 2000 

ISSN: 0028-0283 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 8206 LINE COUNT: 00727 

... to assign output probabilities, but it is also true that Monte 

Carlo simulation (or systematic evaluation of each possible probability 
- weighted "state" for the inputs) is required (in the multivariate input 
case) in order to derive... 
? t25/3, k/35, 37 
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CAPTIONS: General information flow, (chart); COA rankings for the 7 
information states; utility scores; weights . (table) ; True conditional 
probabilities ; estimated expected scores . (table) ; Results of initial 
calculation; results of simulated decision problem, (table) 
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A personal computer-based expert system that assists users in selecting 
methods to evaluate the safety of chemical facilities has been 
developed at Battelle. 

News Release October 13, 1987 p. 1 

. . . for Hazard Identification and Evaluation) , provides managers and 
risk analysts with guidance in selecting an appropriate hazard analysis 

method for a specific plant situation and safety application. SOPHIE eases 
this problem by providing weighted suggestions for appropriate hazard 

evaluation 'procedures. These suggestions are based on factors such as 
plant design stage, plant systems characteristics... 
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08261819 Supplier Number: 69704731 (USE FORMAT 7 FOR FULLTEXT) 
Sanguine Corp. Contracts for R&D of Advanced Synthetic Red Blood Cells. 

Business Wire, p0616 
Jan 30, 2001 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 680 

the meeting and led the Selection Committee through a numerical 
selection process in which 11 criteria were scored for each proposal by 
each evaluator . Appropriate relative importance weights were then 
applied to each criterion and a figure-of -merit (FOM) was established for 
each proposal. 

He summarized the results... 
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Selecting and monitoring hotel -management companies . 

Rainsford, Peter 

Cornell Hotel & Restaurant Administration Quarterly, v35, n2, p30(6) 
April, 1994 

ISSN: 0010-8804 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 2 695 LINE COUNT: 00264 

for evaluating the performance of a management company after it has 
been retained. Again, the weighted scores are probably the best 
measurement of relative importance, as shown in Exhibit 2. (Since there 
were only 12 choices, only three forced choices were allowed, rather than 
the five forced choices used in the selection- criteria TABULAR DATA 
OMITTED section.) Using this method, three criteria clearly lead the 
others in terms of importance for evaluating hotel-management company 
performance. They. . . 
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Turning information into knowledge. 

Kress, George 

Industrial Management, v35, n2, p30(3) 
March-April, 1993 

ISSN: 0019-8471 LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT; ABSTRACT 

WORD COUNT: 3277 LINE COUNT: 00264 

...ABSTRACT: piece of information lies in filtering and the subsequent 
assessment of its veracity. Information is evaluated and weighed for 
appropriateness and accuracy before it is either accepted or rejected. A 
technique for making a more... 
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High-speed optical pick up mechanism for use in e.g. CD player/recorder, 
has laser light associated with end structure, and operable for 
transmitting light to reflective unit along optical path 

Patent Assignee: FORMAN G (FORM-I) ; JONES J (JONE-I); MA K (MAKK-I) 

Inventor: FORMAN G ; JONES J; MA K 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20040160886 Al 20040819 US 2003368170 A 20030218 200462 B 



Priority Applications {No Type Date) : US 2003368170 A 20030218 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20040160886 Al 34 G11B-007/135 

Abstract (Basic) : US 20040160886 Al 

NOVELTY - The mechanism has a pivotable structure with two ends in 
which one end of the structure is located remotely from another end of 
the structure. A reflective unit is operable for sending light to 



and/or receiving light from a surface of an optical storage medium 
(16). A laser light (12) is associated with an end of the structure, 
where the laser light is operable for sending light to a reflective 
unit along an optical path. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following: 

(A) a high speed optical read/write system 

(B) a high speed method for reading data from and/or writing data 
to an optical storage medium. 

USE - Used in CD player, DVD player/recorder for reading data 
and/or writing data to/from an optical storage medium e.g. CD and DVD. 

ADVANTAGE - The pick up mechanism allows both sides, top and 
bottom, of an optical storage medium to be read from and/or written to, 
effectively doubling the capacity of the current generation of optical 
storage media. This allows relatively new audio and video compression 
techniques to be maximized and leveraged. The pick up mechanism using a 
remote laser source provides a relatively fast seek/access times with 
relatively low track-to- track latency and relatively high capacity, 
and hence desirable for time-sensitive applications e.g. 
graphics-intensive computer gaming applications, imaging applications, 
and storage applications, where data transfer rates and seek/ access 
times are critical performance criteria . The mechanism replaces the 
relatively heavy, bulky optical read/write head with a relatively 
simple reflective unit. The reflective unit is a microstructure mirror, 
a movable micro-electromechanical systems (^MEMS % ) mirror, and hence 
allows the actuation/ tracking mechanism to move with increased speed 
relative to the surface of the optical storage medium. The mechanism 
allows both surfaces of the optical storage medium to be read from 
and/or written to and a stronger, higher laser is used. 

DESCRIPTION OF DRAWING (S) - The drawing shows a perspective view of 
a high-speed optical , read/write pick-up mechanism. 

Laser light (12) 

Optical storage medium (16) 

Semi-reflective mirror (20) 

Optical read/write pick-up mechanism (30) 

Actuation device (42) 
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Hierarchy structure analysis tool for Internet applications, has panel 
with unified display having intuitive visual representation of predictive 
features and distribution of features within hierarchy 

Patent Assignee: FORMAN G H (FORM-I) ; SUERMONDT H J (SUER-I) 

Inventor: FORMAN G H ; SUERMONDT H J 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 
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Priority Applications (No Type Date): US 200296452 A 20020312 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030174179 Al 18 G09G-005/00 

Abstract (Basic): US 20030174179 Al 

NOVELTY - The tool has a panel comprising an unified display with 
an intuitive visual representation of selected predictive features and 
distribution of the features within a classification hierarchy. The 
features are representative of a set of cases and classification 
assignments of the cases. The representation is a symbolic 
representation that visually displays coherence of the hierarchy. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) a method for displaying an organizational hierarchy structure 

(b) a computer memory comprising codes for performing the 
organizational hierarchy displaying method 

(c) a method for analyzing feature relationships in a predetermined 
structure having hierarchy of classes. 

USE - Used for gaining insight into hierarchical structures in 
Internet applications 

ADVANTAGE - The tool helps to show the strength or weakness of 
features used in hierarchical configuration and enables a user to 
intuitively visualize the places where a change would be beneficial. 
The method also indicates the 'portions where more training examples 
will be needed for effective training because of coherence and 
complexity of the learned concept. 

DESCRIPTION OF DRAWING (S) - The drawing shows a block diagram of a 
categorization process for developing a hierarchy , which may be the 
subject of the visualization process. 

Unclassified item (10) 

Featurizer item (14) 

List of features (16) 

Categorizer system knowledge database (20) 
pp; 18 DwgNo 1/6 
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Method of detecting trends in temporarily ordered set of case log 
document used in e.g. patient support organization, involves comparing 
distribution of sets of items that are established based on selected 
temporal parameters 

Patent Assignee: HSU M (HSUM-I); SUERMONDT H J (SUER-I) 

Inventor: HSU M; SUERMONDT H J 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 
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Priority Applications (No Type Date) : US 2002113318 A 20020329 
Patent Details: 
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US 20030187809 Al 7 G06F-017/00 

Abstract (Basic) : US 20030187809 Al 

NOVELTY - The method involves establishing two sets of items (42) 
based on selected temporal parameters, after assigning items to a 
topic in an item topic hierarchy of desired granularity. The 
changes in trends is detected by comparing distribution of the set of 
items. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 
apparatus for detecting trends in temporarily ordered set of case log 
document . 

USE - For detecting trends in temporarily ordered set of case log 
document used in patient support organization and customer support 
organization and for detecting trends in retail sales. 

ADVANTAGE - Provides information about changes or trends which is 
used in making important decisions, reliably. 

DESCRIPTION OF DRAWING (S) - The figure shows the flowchart of the 
detection process. 
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Coherence measurement method for web applications, involves evaluating 

labeled documents under one respective topic, with respect to documents 

in local environment of topic 
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Abstract (Basic) : US 20030145009 Al 

NOVELTY - The method involves receiving hierarchy of nodes such 



as topics (114) and training cases representing labeled documents 
(118) that are filed under each of the nodes. The coherence measurement 
is performed for at least one of the nodes, by evaluating the documents 
under the node with respect to the documents in the local environment 
of the node. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

( 1 ) degree of coherence measurement apparatus ; and 

(2) degree of coherence measurement system. 

USE - For web applications such as for categorizing electronic mail 
into hierarchical folders, and in organization of topic , hierarchy 
at news service. 

ADVANTAGE - Facilitates design of hierarchies in such a way as to 
tailor the designated hierarchies so that automated categorization 
program places content in accurate and efficient manner. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the coherence analyzer. 

coherence analyzer (110) 
topic hierarchy (114) 

labeled documents (118) 
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